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ABSTRACT 

This paper describes a system for the computer 
understanding of English. The system answers questions, 
executes commands, and accepts information In normal English 
dialog. It uses semantic Information and context to understand 
discourse and to disambiguate sentences. It combines a complete 
syntactic analysis of each sentence with a "heuristic 
understander" which i/ses different kinds of information about a 
sentence, other parts of the discourse, and general information 
about the world in deciding what the sentence means. 

It is based on the belief that a computer cannot deal 
reasonably with language unless It can "understand 11 the subject 
It Is discussing. The program Is given a detailed model of the 
knowledge needed by a simple robot having only a hand and an 
eye. We can give it Instructions to manipulate toy objects. 
Interrogate It about the scene, and give It Information it will 
use in deduction. In addition to knowing the properties of toy 
objects, the program has a simple model of Its own mentality. 
It can remember and discuss its plans and actions as well as 
carry them out. It enters into a dialog with a person, 
responding to English sentences with actions and English 
replies, and asking for clarification when its heuristic 
programs cannot understand a sentence through use of context and 
phys ical knowledge. 

In the programs, syntax, semantics and inference are 
Integrated In a "vertical" system In which each part is 
constantly corrmuni eating with the others. We have explored 
several techniques for integrating the large bodies of complex 
knowledge needed to understand language. We use Systemic 
Grammar, a type of syntactic analysis which Is designed to deal 
with semantics. Rather than concentrating on the exact form of 
rules for the shapes of linguistic constituents. It is 
structured around choices for conveying meaning. It abstracts 
the relevant features of the linguistic structures which are 
important for interpreting their meaning. 

We represent many kinds of knowledge in the form of 
procedures rather than tables of rules or lists of patterns. By 
devel oping spec ial procedural 1 anguages for grammar, seman tics, 
and deductive logic, we gain the flexibility and power of 
programming languages while retaining the regularity and 
unders tandahi 1 1 1 y of simpler rul*? forms. Each piece of 
knowledge can he 3 procedure, and can call on any other piece of 
knowledge in the system. 

Thesis Supervisor: Seymour A. Papprt, Professor of Applied 
Mathemat ics 



Note on the Organization of the Text 

This paper was written to be readable at several different 
levels of detail. The Preface Is Intended to be understandable 
to a layman with no special knowledge of linguistics or 
computers, and gives a general idea of the purposes and methods. 
The Introduction gives somewhat more detail, along with a sample 
of a dialog with the program* It explains more specifically how 
the program is organized, and what theories were used in Its 
construction. 

The remaining chapters each contain a general Introductory 
section, followed by further sections explaining the details of 
the programs and theories. It should be possible to get a good 
basic understanding of the paper by reading the introduction, 
followed by the first section of each chapter. In addition, 
there was an attempt to keep sections independent so that parts 
of the paper could be selected separately. For example, the 
description of the PROGRAMMAR language, and the description of 
our grammar of English can be read Independently. 

Because of this structure, some parts of the paper are 
redundant — a particularly Important principle or relevant 
example may be repeated at all three levels of explanation, and 
In the different sections where It Is needed. However It Is 
hoped that It will allow the reader to go Into the content as 
deeply as he wants without getting bogged down in detail. 
Throughout the text, single-spaced paragraphs are used for 
emphasis. 
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Preface — Talking to Computers 
Computers are being used today to take over many of our 
jobs- They can perform millions of calculations In a second, 
handle mountains of data, and perform routine office work much 
more efficiently and accurately than humans. But when It comes 
to telling them what to do, they are tyrants. They Insist on 
being spoken to In special computer languages, and act as though 
they can't even understand a simple English sentence. 

Let us envision a new way of using computers so they can 
take Instructions in a way suited to their Jobs. We wilt talk 
to them just as we talk to 3 research assistant, librarian, or 
secretary, and they will carry out our commands and provide us 
with the information we ask for. If our Instructions aren't 
clear enough, they will ask for more information before they do 
what we want, and this dialog will all be In English. 
Why isn't this being done now? Aren't computers 
translating foreign languages and conducting psychiatric 
Interviews? Surely It must be easier to understand simple 
requests for information than to understand Russian or a 
person's psychological problems. The key to this question Is In 
understanding what we mean by "understanding". Computers are 
very adept at manipulating symbols -- at shuffling around 
strings of letters and words, looking them up in dictionaries, 
and rearranging them. In the early days of computing, some 
people thought that simple applications of these capabilities 
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might be just what was needed to translate languages. The 

government supported a tremendous amount of research into 

language translation, and a number of projects tried different 

approaches. In 1966 a corrmlttee of the National Academy of 

Sciences wrote a report evaluating this research and announced 

sadly that it had been a failure* Every project ran up against 

the same brick wall — the computer didn't know what tt was 

tal king about . 

When a human reader sees a sentence, he uses knowledge to 
understand It. This includes not only grammar, but also his 
knowledge about words, the context of the sentence, and most 
important, his knowledge about the subject matter. A computer 
program supplied with only a grammar for manipulating the syntax 
of language could not produce a translation of reasonable 
qual i ty. 

Everyone has heard the story of the computer thai tried to 

translate "The spirit is willing but the flesh is weak. 1 ' into 

Russian and came out with something which meant "The vodka Is 

strong but the meat is rotten." Unfortunately the problem Is 

much more serious than just choosing the wrong words when 

translating idioms- it Isn't always possible to even choose the 

right grammatical forms. We may want to translate the two 

sentences "A message was delivered by the next visitor," and "A 

message was delivered by the next day." If we are translating 

Into a language which doesn't have the equivalent of our 

"passive voice", we may need to completely rearrange the first 

sentence Into something corresponding to "The next visitor 

delivered a message." The other sentence might become something 
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like "Before the next day, someone delivered a message*" If the 
computer picks the wrong form for either sentence, the meaning 
Is totally garbled. In order to make the choice. It has to know 
that visitors are people who can deliver messages, while days 
are units of time and cannot. It has to "understand" the 
meanings of the words "day" and "visitor". 

In other cases the problem is even worse. Even a knowledge 
of the meanings of words Is not enough. Let us try to translate 
the two sentences: 

"The city councllmen refused to give the women a 

perm 1 1 for a demon st rat Ion because they feared violence," 

and 

"The city councilmen refused to give the women a 

permit for a demonstration because they advocated 

revolut ion," 

If we are translating into a language (like French) which 
has different forms of the word "they" for masculine and 
feminine, we cannot leave the reader to figure out who "they" 
refers to. The computer must make a choice and if It chooses 
wrong, the meaning of the sentence Is changed. To make the 
decision. It has to have more than the meanings of words. It 
has to have the information and reasoning power to realize that 
city councllmen are usually staunch advocates of law and order, 
but are hardly likely to be revolutionaries. 

For some uses. It Isn't really necessary to understand 
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much. There has been much publicity about a well known 

"psychiatrist" program named ELIZA. It imitates the kind of 

Rogerlan psychiatrist who would respond to a Question like "What 

time Is It?" by asking "Why do you want to know what time It 

Is?" or muttering "You want to know what time It is!". This can 

be done without much understanding- All It needs to do Is take 

the words of the question and rearrange them In some simple way 

to make a new question or statement. In addition It recognizes 

a few key words, to respond with a fixed phrase whenever the 

patient uses one of them. If the patient types a sentence 

containing the word "mother", the program can say "Tell me more 

about your family! 11 . In fact, this Is just how the psychiatrist 

program works. But very often It doesn't work « Its answers 

are silly or meaningless because It isn't really understanding 

the content of what Is being said. 

If we really want computers to understand us, we need to 
give them the ability to use more knowledge. In addition to a 
grammar of the language, they need to have all sorts of 
knowledge about the subject they are discussing, and they have 
to use reasoning to combine facts in the right way to understand 
a sentence and respond to it. The process of understanding a 
sentence has to combine grarrmar, semantics, and reasoning In a 
verv intimate way, calling on each part to help with the others. 

This thesis explores one way of giving the computer 

knowledge In a flexible and usable form. In addition to basic 

tools and operations for understanding language, we Rive the 

computer special (zed Informa t Ion about the Fngl Ish language, the 

words we win use, and the subject we will discuss. In most 

earl ier computer programs for understanding language, there have 
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been attempts to use these kinds of Information In the form of 

lists of rules, patterns, and formulas . 

In our system, knowledge Is expressed as orozrams In 
spec ial languages des Igned for syntax, semantics, and reasoning. 
These languages have the control structure of a programming 
language* with the statements of the language explicitly 
controlling the process. This makes It possible to relate the 
different areas of knowledge more directly and completely. The 
course of the understanding process can be determined directly 
by special knowledge about a word, a syntactic construction, or 
a particular fact about the world. 

This gives greater flexibility than a program with a fixed 
control structure. In which the specific knowledge can only 
Indirectly control the process of understanding. By using 
languages specially developed for representing these kinds of 
knowledge. It Is possible for a person tn "teach" the computer 
what It needs to know about a new subject or a new vocabulary 
without being concerned with the details of how the computer 
will go about using the knowledge to understand language. For 
simple Information, It Is even possible to just "tell" the 
computer In English, Other systems make It possible to "tell" 
the computer new things by allowing It to accept only very 
specialized kinds of Information. By representing Information 
as programs, we can greatly expand the range of things which can 
be Included. 

The best way to experiment with such Ideas Is to write a 
working program which can actually understand language. We 
would like a program which can answer questions, carry out 
commands, and accept new Information In English. If we really 
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want it to understand language, -we must give It knowledge about 

the specific subject we want to talk about* 

For our experiment, we pretended that we were talking to a 
simple robot, with a hand and an eye and the ability to 
manipulate toy blocks on a table. We can say, "Pick up a block 
which is bigger than the one you are holding and put It In the 
box.' 1 , or ask a sequence of questions like "Had you touched any 
pyramid before you put the green one on the little cube?" "When 
did you pick It up?" "Why?", or v/e can give ft new Information 
like "I like blocks which are not red, but I don't like anything 
which supports a pyramid," The "robot" responds by carrying out 
the commands (In a simulated scene on a display screen attached 
to the computer), typing out answers to the questions, and 
accepting the information to use In reasoning later on* 

The dialog is carried out by typing on a terminal attached 
to the computer time-sharing system. There arp a number of hard 
technical problems in getting a computer to communicate by 
voice, and It has not been attempted. 

We had three main kinds of coals In writing such a program. 
The first Is the practical goal of having a language- 
understanding system, fven though we used the robot as our test 
area, the language programs do not depend on any special subject 
matter, and they have been adapted to other uses. 

The second goal is gaining a better understanding of what 
language is and how it is put together. To write a program we 
need to make all of our knowledge about language very explicit, 
and we have to be concerned with the entire language process, 
not just one area such as syntax. We need the most advanced 
theories which linguists and others have developed, and we must 
fit them together to get the program working. This provides a 
rigid tost for linguistic theories, and leads us into making new 
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theories to fill the places where the old ones are lacking. To 

make a computer understand language, we have to Increase our 

knowledge of how people understand language, and one Important 

goal Is the Improvement of our comprehension of language and 

1 Ingulstlcs. 

More generally, we want to understand what Intelligence Is 
and how It can be put Into computers. Language Is one of the 
most complex and unique of human activities, and understanding 
Its structure may lead to a better theory of how our minds work* 
The techniques needed to write a language-understanding program 
may be useful In many other areas of Intelligence such as 
vision, mathematical problem solving, and game playing. All of 
these areas have similar problems of Integrating large amounts 
of knowledge Into a flexible system. 

With advances In artificial Intelligence we will some day 

be able to combine all of these areas to provide computers which 

ar^ not tyrants, but can understand what we tell them and do 

what we want them to In an Intelligent way. 
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Chapter I — Introduction 

1.1 General Description 
This paper describes a system for the understanding of 
English by a computer. The system answers Questions, executes 
cormiands, and accepts Information In normal English dialog. It 
uses semantic Information and context to understand discourse 
and to disambiguate sentences both syntactically and 
semantical 1 y. It combines a complete syntactic analysis of each 
sentence with a "heuristic understander 11 which uses different 
kinds of Information about a sentence/ other parts of the 
discourse, and general Information about the world In deciding 
what a sentence means. 

It Is based on the belief that a computer cannot deal 
reasonably with language unless It can "understand" the subject 
It is discussing. The program Is given a detailed model of a 
small part of knowledge, and It can understand sentences 
concerning the subject It knows about. We use as an example the 
knowledge needed by a simple robot conslslstlng of only an eye 
and a hand. We can give It Instructions to manipulate toy 
objects. Interrogate It about the scene, and give It Information 
It will use In deduction. 

In addition to knowing the properties of blocks, hands, 
etc., the program has a simple model of Its own mentality. It 
can remember and discuss Its plans and actions as well as carry 
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them out. It enters Into a true dialog with a person, 
responding to English sentences with actions and English 
replies, and asking for clarification when Its heuristic 
programs cannot disambiguate a sentence through use of context 
or physical knowledge. 
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1.1*1 Vi Mt Ik Language ? 

To write a computer program which understands natural 
language, we need to understand what language Is and what It 
does. It should be approached not as a set of mathematical 
rules and symbols, but as a system Intended to communicate Ideas 
from a speaker to a hearer, and we want to analyze how It 
achieves that communication. It can be viewed as a process of 
translation from a structure of "concepts' 1 In the mind of the 
speaker. Into a string of sounds or written marks, and back Into 
concepts In the mind of the hearer. 

In order to talk about concepts, we must understand the 
Importance of mental models (see <Mlnsky 1965>). In the flood 
of data pouring Into our brains every moment, people recognize 
regular and recurrent patterns. From these we set up a model of 
the world which serves as a framework In which to organize our 
thoughts. We abstract the presence of particular objects, 
having properties, and entering Into events and relationships. 
Our thinking Is a process of manipulating the "concepts" which 
make up this model. Of course, there Is no way of actually 
observing the Internal workings of a person's mind, but In 
Section 3.1 we will discuss the justification for postulating 
such a "model" In analyzing the human use of language. In 
Section 3.b we show what this model might look like for a small 
area of knowledge, and describe how It can be used for 
reasoning. 
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When we communicate with others, we select concepts and 
patterns from the model and map them onto patterns of sound, 
which are then reinterpreted by the hearer In terms of his own 
model. A theory can concentrate on either half of this process 
of generation and Interpretation of language. Even though a 
complete theory must account for both, Its approach Is strongly 
colored by which one It views as logically primary. Most 
current theories are "generat I ve", but It seems more Interesting 
to look at the Interpretive side (see <Wlnograd 1969> for a 
discussion of the Issues Involved). The first task a child 
faces Is understanding rather than producing language, and he 
understands many utterances before he can speak any. At every 
stage of development, a person can understand a much wider range 
of patterns than he produces tsec <M I 1 ler>. Chapter 7). A 
program Is not a detailed psychological theory of how a person 
Interprets language, but there may In fact be very Informative 
parallels, and at a high level. It may be a reasonable 
s imulat Ion. 

Language understanding Is 4 kind of intellectual activity. 
In which a pattern of sounds or written marks Is Interpreted 
Into a structure of concepts In the mind of the Interpreter. We 
cannot think of It as being done In simple steps: 1. Parse; 2. 
Understand the meaning; 3, Think about the meaning. The way we 
parse a sentence Is controlled by a continuing semantic 
Interpretation which guides us In a "meaningful" direction. 
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When we see the sentence "He gave the boy plants to water* 11 
we don't get tangled up In an Interpretation which would be 
parallel to "He gave the house plants to charity/' The phrase 
"boy plants" doesn't make sense like "house plants" or "bov 
scouts", so we reject any parsing which would use It. 

Syntax, semantics, and Inference must be Integrated In a 
close way, so that they can share In the responsibility for 
Interpretation. Our program muse Incorporate the flexibility 
needed for this kind of "vertical" system In which each part Is 
constantly talking to the others. We have explored several 
techniques for Integrating the large bodies of complex knowledge 
needed to understand language. Two are particularly Important. 

First, we use a type of syntactic analysis which Is 
designed to deal with questions of semantics. Rather than 
concentrating on the exact form of rules for shuffling around 
linguistic symbols. It studies the way language Is structured 
around choices for conveying meaning. The parsing of a sentence 
Indicates Its detailed structure, but more Important It 
abstracts the "features" of the linguistic components which are 
Important for Interpreting their meaning. The syntactic theory 
includes an analysis of the way language Is structured to convey 
Information through systematic choices of features. The other 
parts of the program can look directly at these relevant 
features, rather than having to deal with minor details of the 
way the parsing tree looks. 
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Second, we represent knowledge In the form of procedures 
rather than tables of rules or 1 ists of patterns, 8y developing 
special procedural languages for grammar, semantics, and 
deductive logic, we gain the flexibility and power of programs 
while retaining the regularity and understandlbl 1 I ty of simpler 
rule forms. Since each piece of knowledge can be a procedure. 
It can call on any other piece of knowledge of any type. The 
parser can call semantic routines to see whether the line of 
parsing It Is following makes any sense, and the semantic 
routines can call deductive programs Co see whether a particular 
phrase makes sense In the current context. This Is particularly 
Important In handling discourse, where the Interpretation of a 
sentence containing such things as pronouns may depend In 
complex ways on the preceding discourse and knowledge of the 
subject matter. 

This dual view of programs as data and data as programs 
woul d not have been possible In trad 1 1 lonal programing 
languages. The special languages for expressing facts about 
grammar, semantics, and deduction are embedded In LISP, and 
share with It the capability of Ignoring the artificial 
distinction between programs and data. 
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1-1-2 Organization &£ jji£ LARRupse UNCI'S tflntfl flft PrPC»S 

We can divide the process of language understanding Into 

three main areas — syntax, semantics, and Inference. As 
mentioned above, these areas cannot be viewed separately but 
must be understood as part of an Integrated system. 
Nevertheless, we have organized our programs along these basic 

lines, since each area has Its own tools and concepts which make 

It useful to write special programs for It. 

Listing these aspects of language understanding separa tel y 
is somewhat misleading, as It Is the Interconnection and 
Interplay between them which makes the system possible. Our 
parser does not parse a sentence, then hand It off to an 
Interpreter, As It finds each piece of the syntactic 
structure. It checks Its semantic Intepretat Ion, first to see If 
It Is plausible, then (If possible) to see If It Is In accord 
with the system's knowledge of the world, both specific and 
general. This has been done In a limited way by other systems, 
but In our program It Is an Integral part of understanding at 
every level . 

A. Syntax 

First we need a system for the syntactic analysis of Input 

sentences, and any phrases and other non-sentences we might want 

In our dialogs. There have been many different parsing systems 

developed by different language projects, each based on a 

particular theory of granuiar. The tvoe flf grafmar chosen plays 

a major role In the type of semantic analysis which can be 

carried out. A language named PROGRAMMArt was designed 

specifically to fit the type of analysis used In this system. 

It differs from other parsers In that the grammar Itself Is 

written In the form of a collection of programs, and the parsing 
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system Is In effect an Interpreter for the language used In 

writing those programs. 

Having chosen the "type of Grammar", we need to formalize a 

grammar for parsing sentences In a particular language. Our 

system Includes a comprehens I ve .grammar of Engl Ish fol lowing the 

lines of systemic grammar (see Section 2*3). This type of 

grammar Is well suited to a complete language-understanding 

system since It views language as a system for conveying meaning 

and Is highly oriented toward semantic analysis. It Is Intended 

to cover a wide range of syntactic constructions; one basic 

criterion for the completeness of the grammar Is that a person 

with no knowledge of the system or Its grammar should be able to 

type any reasonable sentence within the limitations of the 

vocabulary and expect It to be understood. 

B. Inference 

At the other end of the linguistic process we need a 
deductive system which can be used not only for such things as 
resolving ambiguities and answering questions, but also to allow 
the parser to use deduction In trying to parse a sentence* The 
system uses PLANNER, a deductive system designed by Carl Hewitt 
(see <HewItt 1969, 1970>) which Is based on a philosophy very 
similar to the general mood of this project. Deduction In 
PLANNER is not carried out In the traditional "logistic 
framework" In which a general procedure acts on a set of axioms 
or theorems expressed in a formal system of logic. Instead, 
each theorem Is In the form of a program, and the deductive 
process can be directed to any desired extent by "Intelligent 
theorems." PLANNER Is actually a language for the writing of 
chose theorems. 

This deductive system must be given a model of the world, 

with the concepts and knowledge needed to make Its deductions. 

Useful language-understanding can occur only when a program (or 
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person) has an adequate understanding of the subject he Is 
talking about. We will not attempt to understand arbitrary 
sentences talking about unknown subjects, but Instead will give 
the system detailed knowledge about a particular subject " In 
this case, the simple robot world of children's toy blocks and 
some other common objects. The deductive system has a double 
task of solving goal-problems to plan and carry out actions for 
a robot within this world, and then talking about what It Is 
doing and what the scene looks like. We want the robot to 
discuss Its plans and actions as well as carry them out. We can 
ask questions not only about physical happenings, but also about 
the robot's goals. We can ask "Why did you clear off that 
block?" or "How did you do It?". This means that the model 
Includes not only the properties of blocks, hands, and tables, 
but a model of the robot mind as well. We have written a 
collection of PLANNER theorems and data called BLOCKS, 
describing the world of toy blocks seen and manipulated by the 
robot, and the knowledge It needs to work with that world, (see 
Section 3.i*) . Figure 1 shows a typical scene, 
0. Semantics 

To connect the syntactic form of the sentence to Its 
meaning, we need a semantic system which provides primitive 
operations relevant to semantic analysis. This Includes a 
language In which we can easily express the meanings of words 
and syntactic constructions. The system Includes mechanisms 
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for setting up simple types of semantic networks end using 
deductions from them as a first phase of semantic analysis. For 
example, the network could Include the Information that a 
"block" Is a physical object, while a "bloc" Is a political 
object, and the definition of th< word "support" could use this 
Information In choosing the correct meanings for the sentences: 

The red block supports the pyramid* and 

The red bloc supports Egypt, 

More Important, the meaning of a word or construction Is 

also defined In the form of a program to be Interpreted In a 

semantic language. It Is this procedural aspect of semantics 

which Is missing In most other theories, which limit themselves 

to a particular type of network or relational structure- The 

meaning selected for a word can depend on any aspect of the 

sentence, the discourse, or the world. In deciding on the 

meaning of "one" In "Pick up the green one", we need a program 

which can examine past sentences. This program Is Included as 

part of the definition of the word "one". 

The semantic system Includes a powerful heuristic program 
for resolving ambiguities and determining the meaning of 
references In discourse. In almost every sentence, reference Is 
made either explicitly (as with pronouns) or Implicitly (as with 
the word "too") to objects and concepts not explicitly mentioned 
In that sentence. To interpret these, the program must have at 
Its disposal not only a detailed grammatical analysis (to check 
for such things as parallel constructions), but also a powerful 
deductive capacity (to see which reference assignments are 
logically plausible), and a thorough knowledge of the subject It 
Is discussing (to see which Interpretations are reasonable In 
the current si tuat Ion) , 
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In order to deal with language In a human way/ we must take 
into account all sorts of discourse knowledge* In addition to 
remembering the Immediate! y previous sentences for such things 
as pronoun references, the system must remember what things have 
been mentioned throughout the discussion, so that a reference to 
"the pyramid" will mean "the pyramid we mentioned earlier" even 
If there are several pyramids In the scene. 

In addition the system must have some knowledge of the way 
a person will communicate with It. If we ask "Is there a block 
on a green table?" "What color Is It?", the word "It" refers to 
the block. But If we had asked "Is there a green block on a 
table?" "What color Is It?", "It" must refer to the table since 
we would not ask a question which we had answered ourselves In 
the previous sentence. 

Our semantic system works with a base of knowledge about 
simple semantic features In the subject domain, and with a 
collection of definitions for Individual words. These 
definitions are written In a "semantics language" which allows 
simple words to be defined In a straightforward way, while 
allowing more complex words to call on arbitrary amounts of 
computation to Integrate their meaning Into the sentence. 

Finally we need a generative language capacity to produce 
answers to questions and to ask questions when necessary to 
resolve ambiguities. Grammatically this Is much less demanding 
than the Interpretive capacl ty, s I nee humans can be expected to 
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understand a wide range of responses/ and It Is possible to 
express almost anything In a syntactically simple way. However, 
It takes a sophisticated semantic and deductive capability to 
phrase things In a way which Is meaningful and natural In 
discourse, since the form of a response depends on both the 
context and on what the speaker assumes that the hearer knows 
and wants to know. 
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1.2 Implementation of the System 
The language understanding program Is written In LISP to 
run under the PDP-10 Incompatible Time-sharing System at the 
Artificial Intelligence Laboratory at MIT. When operating with 
a 200 word vocabulary and a fairly complex scene. It occupies 
approximately 30K of core. This Includes the LISP Interpreter, 
al 1 of the programs, dictionary entries, and data, and enough 
free storage to remember a sequence of actions and to handle 
complex sentences and deductions. See Figure 3 for a more 
detailed descrlpton of memory usage. 

The program Is organized as Indicated In Figure 2, (Arrows 
Indicate that one part of the program calls another directly): 



MONITOR 

/ 

INPUT . GRAMMAR 




DICTIONARY*^ PROGRAMMAR SEMANTIC 

FEATURES 



PLANNER DATA 

Figure 2 -- Organization of the Programs 



1. MONITOR Is a small LISP program which calls the basic 
parts of the system. Since the system Is organized vertlcaly, 
most of the communication between components Is done directly, 
and the monitor Is called only at the beginning and end of the 
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Parser 



Semantics 



Deduction Other 



Interpreters 


PROGRAMMAR 
5.8 




PLANNER 
5.5 


LISP 
and 
Display 
U.8 




Knowledge 
of Engl 1 sh 
22.5 


GRAMMAR 
7.3 


SEMANTICS 
15.2 








Knowledge 
of Subject 
16. S 


DICTIONARY 
1.7 


DICTIONARY 
6.0 


BLOCKS 
8.8 






Data for 
Scene 

2.5 






Assertions 
1.3 


Display 
1.2 




Total 

G7.6 


U.8 


21.2 


15.6 


16.0 





Storage Allocation for Language Understanding Program 
In Thousands of POP-10 words 

Note! Approximately 12 thousand additional words of free 
storage are necessary for a dialog like the one described In 
Section 1.3. As the length of dialog or complexity of the 
actions Is Increased, more free storage Is needed. 



Figure 1 — Memory Requirements 
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understanding process , 

2. INPUT Is a LISP program which accepts typed Input In 
noriral English orthography and punctuation, looks up words In 
the dictionary, performs morphemic analysis (e.g, realizing that 
"running" is the "Ing" form of the word "run", and modifying the 
dictionary definition accordingly), and returns a string of 
words, together with their definitions. This Is the Input with 
which the grammar works* 

3. The GRAMMAR Is the main coordinator of the language 
understanding process. It consists of a few large programs 
written In PROGRAMMAR to handle the basic units of the English 
language (such as clauses, noun groups, prepos I tonal groups, 
etc.). There are two PROGRAMMAR compilers, one which compiles 
Into LISP, which Is run Interpretlvely for easy debugging, and 
another which makes use of the LISP compiler to produce LAP 
assembly code for efficiency. 

b. SEMANTICS Is a collection of LISP programs which work In 
coordination with the GRAMf4AR to Interpret sentences. In 
general there are a few semantics programs corresponding to each 
basic unit In the grammar, each performing one phase of the 
analysis for that unit. These semantics programs call PLANNER 
to make use of deduction In Interpreting sentences, 

5. ANSWER Is another collection of LISP programs which 
control the responses of the system, and take care of 
remembering the d Iscourse for future reference. 1 1 contal ns a 
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number of heuristic programs for producing answers which take 
the discourse Into account, both In deciding on an answer and In 
figuring out how to express It In fluent English. 

6. PROGRAMMAR Is a parsing system which Interprets grammars 
written In the form of programs. It has mechanisms for building 
a parsing tree, and a number of special functions for exploring 
and manipulating this tree in the GRAMMAR programs. It Is 
written in LISP. 

7. The DICTIONARY actually consists of two parts. The first 
Is a set of syntactic features associated with each word, used 
by the GRAMMAR. The second Is a semantic definition for each 
word, written In a language which Is Interpreted by the 
SEMANTICS programs. The form of a word's definition depends on 
Its word class <e*g. the definition of "two" Is "2") . There are 
special facilities for Irregular forms (like "geese" or 
"slept 11 ), and only the definitions of root words are kept/ since 
INPUT can analyze a variety of endings. The definitions are 
actually kept on the LISP property list of the word, and 
dictionary lookup Is handled automatically by LISP. 

8. The system has a network of SEMANTIC FEATURES, kept on 
property lists and used for an initial phase of semantic 
analysis. The features subdivide the world of objects and 
actions Into simple categories, and the semantic Interpreter 
uses these categories to make some of Its choices between 
alternative definitions for a word (for example "In 11 would mean 
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"contained In" when applied to a hollow object like a box, but 
would mean "part of 11 when applied to a conglomerate like a 
stack, as In "the red block in the tall stack"). 

9. BLOCKS Is a collection of PLANNER theorems which contain 
the system's knowledge about the properties of the physical 
world, and the methods of achieving goals and making deductions 
about It. It Is a very simple model, designed less for a 
realistic simulation of a robot, and more to give the system a 
world to talk about In which one can say many different types of 
things. 

10. MOVER Is a set of display routines written In LAP which 
simulate the robot's world on the DEC 3U0 graphic display 
attatched to the POP- 10 computer. It displays the scene 
Indicated by the DATA currently In memory, and any action called 
for Is displayed In real time, so the human can get visual 
feedback from his commands to the robot. The Input sentences 
and responses are also displayed. Figure 1 displays a typical 
scene, 

11. PLANNER Is the deductive system used by the program, and 
Is used at alt stages of the analysis, both to direct the 
parsing process and to deduce facts about the BLOCKS world. 
Currently the system uses Micro-Planner, an abbreviated version 
of Carl Hewitt's original language. Implemented by Terry 
Winograd, Gerald Sussman, and Eugene Charnlak <Sussman>. It Is 
wrl tten In LISP, 
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12. The DATA about the current scene Is In the form of 
PLANNER assertions describing the objects, their sizes, shapes, 
colors, and locations. The system assumes It has some sort of 
robot "eye", so all of this Information Is Initially known and 
does not have to be entered In English. 

When running compiled, the system is fast enough to carry 
on a real-time discourse. Each sentence takes from 5 to 20 
seconds to analyze and respond to, and the display Is designed 
to move at the speed of a real arm. 

The program listings are all available In hard copy and 
DECtape, but before Implementing them on another system. It 
would be wise to read about the special features of MACLISP 
<WhIte> which may be In conflict with other LISP 
Implementations. 
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1.3 Sample Dialog 
This section contains a sample of a dialog with the 
language understanding system. It carries out actions and 
answers questions about a simple scene containing a table, hand, 
and box, and several blocks and pyramids. The I Ines In lower 
case were typed In by a person, and the lines In upper case are 
the computer 1 s response. Indented 1 Ines are comments describing 
some of the features being exhibited. 

The dialog was carried out In real time with the response 
of the "robot" displayed on a CRT screen, figures are Included 
to show the state of the scene at various points. 

1. pick up a big red block. 

OK. 

The system answers "OK" when It carries out a 
command. In order to pick up the red block, 
It had to clear It off by finding a space for 
the green one and moving the green one away. 
Figure k shows the scene when the coiTfnand was 
given. Figure 5 shows the green block being 
put out of the way, and Figure 6 shows the end 
resul t. 

2. grasp the pyramid, 

I DON'T UNDERSTAND WHICH PYRAMID YOU MEAN. 

There are three pyramids In the scene, so the 
program does nothing. 

3. find a block which Is taller than the one you arc holding 
and put It into the box. 

The system has to figure out what Is meant by 
"one" and "It", as well as using deduction to 
decide which block It Is holding. 
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Figure U 
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BY "IT", I ASSUME YOU MEAN THE BLOCK WHICH IS TALLER THAN THE 
ONE I AM HOLDING. 



OK. 



Printed out to keep us Informed. 

The hand has to set down the red block to pick 
up the taller blue one. Figure 7 shows the 
resul t . 



t», what does the box contain? 

THE BLUE PYRAMID AND THE BLUE BLOCK. 
5. what Is the pyramid supported by? 



Earlier the system responded to "the pyramid" 
by saying "I don't understand which pyramid 
you mean." now. It knows that we must mean 
the one which was just mentioned. 



THE BOX. 
6. how many blocks are not In the box? 



Negatives such as "not" and "no" are handled 
whether they appear In clauses or noun groups 
(such as "no boxes 11 ) * 



FOUR OF THEM. 



7. Is at least one of them narrower than the one which I told 
you to pick up? 

The phrase "at least one of them" must be 
analyzed to r^fer back to the previous 
question, and "the one which..." must also 
refer to blocks. Noun groups can have 
compl icated numbers such as "at least 
one 1 ', "more than three", "exact 1 y seven", etc 
The memory of past events Is used to deduce 
which block Is "the one which I told you to 
pick up". Notice the complicated grammatical 
structure In which "the one" Is actually the 
object of a clause embedded two levels deeper 

YES, THE RED CUBE, 

8. Is It supported? 
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Passives can be used without a "by ..." 
phrase. 



YES, BY THE TABLE. 
9. can the table pick up blocks? 
NO. 



The semantic rules make this a nonsens leal 
question which therefore must be false. 



10, can a pyramid be supported by a block? 
YES. 



The deductive system finds an actual example, 
so It knows this Is possible. 



11. can a pyramid support a pyramid? 
I DON'T KNOW. 

The question makes sense, but there Is no 
example In the scene, so there Is no easy way 
to tell without being able to examine the 
robot 1 s programs * 

12. stack up two pyramids. 

I CAN'T. 

The robot tries and fa I Is. 

13. the blue pyramid Is nice. 

I UNDERSTAND. 

This data Is added to the data base. 

ll». I like blocks which are not red, but I don't like anything 
which supports a pyramid. 



I UNDERSTAND. 



Two new theorems are created for proving 
things about "1 Iking", 



15. do I 1 Ike the box? 
NO, 



The box supports whatever It contains, 
including a pyramid. 
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16. do I like anything In the box? 

YES, TWO THINGS: THE BLUE BLOCK AND THE BLUE PYRAMID. 

Presumably we like anything which Is nice, and 
the blue block Is certainly not red. 

17. will you please stack up both of the red blocks and either 
a green cube or a pyramid? 

OK. 

Logical connectives such as "and" "or" 
"el the r" etc. are handl ed In both the grammar 
and semantics. A command Is executed even If 
It looks like a question. The robot carries 
out the conmand by the series of steps shown 
In figures 8 through 11. It puts a green cube 
on the large r^d block tnote that It chooses 
the green cube with nothing on It), then 
removes the smal 1 pyramid from the little red 
cube, so that It can use that cube to complete 
the stack as specified. 

18. which cube Is sitting on the table? 

THE LARGE GREEN ONE WHICH SUPPORTS THE RED PYRAMID. 

Ordinarily the system would be satisfied with 
calling this M a big green cube' 1 / but since we 
asked "which cube" It has to be more specific 
In naming It. The use of substitute nouns 
(such as "one") and pronouns In answers makes 
the discourse more natural. 

19. Is there a large block behind a pyramid? 

YES, THREE OF THEM: A LARGE RED ONE, A LARGE GREEN CUBE AND 
THE BLUE ONE. 

20* put a small one onto the green cube which supports a 
pyramid. 

"A small one" must contrast with the earlier 
phrase "a large block", so this means "a small 
block". 

OK. 

21* put the littlest pyramid on top of It, 
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Words like "littlest" are not In the 
dictionary but are Interpreted from the root 
forms like "little". The scene now looks like 
Figure 12. 

OK. 

22. does the shortest thing the tallest pyramid 1 s support 
supports support anything green? 

The grammar can handle words like "support" 
which arB both verbs and nouns without getting 
confused, and can understand possesslves. To 
compute the answer, see Figure 13 for the 
scene. 

YES, THE GREEN PYRAMID. 

23, what color Is 1 1? 

BY "IT", I ASSUME YOU MEAN THE SHORTEST THING THE TALLEST 
PYRAMID'S SUPPORT SUPPORTS. 

If we had asked "what size Is It?", the system 
would have assumed that "It" meant the green 
pyramid, but nobody would ask the color of 
something which was just called "the green 
pyramid", so It had to look farther for the 
reference of "It". 

RED 

2**. how many things are on top of green cubes? 

I'M NOT SURE WHAT YOU MEAN BY "ON TOP OF" IN THE PHRASE "ON 
TOP OF GREEN CUBES ". 
00 YOU MEAN: 

1 - DIRECTLY ON THE SURFACE 

2 - ANWHERE ON TOP OF ? 

When there are several Interpretations and 
none Is clearly better, the system has to ask 
the user for more Information. The 
paraphrases were Included with the dictionary 
definition of "on", and the reouest for more 
Information occurs automatically when the 
heuristics for resolving ambiguities aren't 
sufficient. A request like this can also 
occur when the meaning of a pronoun such as 
"It" or "they M Is unclear. 
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2 

THREE OF THEM. 

25. had you touched any pyramid before you put the green one on 
the I Ittle cube? 

YES, THE GREEN ONE. 

The system deduces from Its memory what event 
Is being referred to, and can relate other 
events to It Tn time. The analysis Includes 
complex tenses of Verb groups, and there are 
special facilities for keeping crack of the 
various forms of Irregular verbs like "have". 

26. when did you pick It up? 

WHILE I WAS STACKING UP THE RED CUBE, A LARGE RED BLOCK AND A 
LARGE GREEN CUBE. 

A time Is described by naming the main goal 
which was being worked on. 

27. why? 

The system remembers immediate! y previous 
sentences to understand questions like this 
and the following ones. 



TO GET RID QF IT. 



By keeping track of selected parts of the 
original subgoal tree, the system has some 
understanding of Its own motives. 

26. why did you do that? 

TO CLEAN OFF THE RED CUBE. 

"That" refers to the action mentioned In the 
answer. 

29. why did you clear off that cube? 

Here, "that" means the one just referred to. 

TO PUT IT ON A LARGE GREEN CUBE. 
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30. why did you do that? 

TO STACK UP THE RED CUBE , A LARGE RED BLOCK AND A LARGE GREEN 
CUBE. 

51. why did you do that? 

BECAUSE YOU ASKED ME TO. 

Jt remembers what It did, not how the request 
was worded, 

32, how did you do 1 t? 

"It* 1 refers to the action mentioned In the 
last question, rather than the answer. 

BY PUTTING A LARGE RED BLOCK ON THE TABLE ; THEN LETTING GO 
OF IT ; THEN PUTTING A LARGE GREEN CUBE ON IT ; THEN LETTING GO 
OF THAT CUBE ; THEN PUTTING THE RED CUBE ON THAT CUBE ; THEN 
LETTING GO OF THAT CUBE. 

This Is a description of the top- level 
subgoals used to achieve the goal. 

33. how many objects did you touch while you were dolnc It? 

Time references can refer to a time span, 
using words 1 1 ke "whl le'\ 



FOUR OF THEM. 

3U. what did the red cube support before you started to clean 
It off? 

THE GREEN PYRAMID. 

By remembering what has been moved. It Is 
possible to deduce things about earlier states 
of the scene, VJe can use complex nestings of 
clauses like "before you started to clean It 
off". 

35. there were five blocks to the left of the box then, 

"Then" must refer to the time mentioned In the 
last sentence, 

NO, ONLY FOUR OF THEM: THE RED CUBE , TWO LARGE GREEN CUBES 
AND A LARGE RED BLOCK, 
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Earlier In the dialog. Information about 
"liking" was accepted at face value. But the 
system knows that It has complete Information 
about about the basic properties of objects. 
Including location. It checks any statements 
about location or other such properties to see 
whether it agrees or disagrees. 

36. put the blue pyramid on the block In the box. 

This Is syntactically airblguous, but Is 
understood without ambiguity since the parsing 
corresponding to "put the blue pyramid which 
Is on the block In the box." does not make 
sense In the current scene, whl le "put the 
blue pyramid on the block which Is In the box" 
does. We can see the sltutation In Figure Ik, 
and the result In Figure 15. 

OK. 

37. Is there anything which Is bigger than every pyramid but Is 
not as wide as the thing that supports It? 

The system understands quantifiers such as 
"every", "any", "some", etc., and structures 
such as "bigger than", "as wide as", etc. 
Here "It" refers to a noun group within the 
same sentence but two levels up In the pars Ing 
tree. 

YES, THE BLUE BLOCK. 

39. does a steeple 

SORRY, I DON'T KNOW THE WORD "STEEPLE". PLEASE TYPE TWO SPACES. 

The system Interrupts Immediately when It sees 
a word It can't figure out. The two spaces 
are to clear the Input buffer of anything 
typed after the message was sent but before 
the typl st saw I t . 

*i0. a "steeple" Is a stack which contains two green cubes and a 
pyramid. 

The quote marks Indicate that we are defining 
a new word. 
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I UNDERSTAND, 
41. are there any steeples now? 
NO. 



The new definition Is used as a description to 
recognize steeples. 



1*2. bui Id one. 

OK. 

The description given In the definition Is 
enough to deduce a plan for building new 
objects as well as recognizing them. The word 
"one 11 means M a steeple", and the plan Is 
carried out. First, one of the large green 
cubes Is cleared off by removing the red cube 
and two pyramids. Then the two green cubes 
and the tall red pyramid are stacked up to fit 
the descr Ipt Ion. Figures 26 through 19 show 
the Initial state, some Intermediate steps, 

and the resul t . 
U3. call the biggest block "superblock". 

OK. 

We can also give objects names. 

kU, have you picked up superblock since we began? 

YES. 

1*5. why did you drop It? 

BY "IT", I ASSUME YOU MEAN THE SUPERBLOCK. 

I CAN'T EXPLAIN A NON-EXISTENT EVENT. 

There Is no record In memory of any block 
being dropped. 

46. Is there anything to the right of the red pyramid? 

YES, FOUR THINGS: THE BOX , THE REO CUBE, THE BLUE PYRAMID 
AND THE BLOCK NAMED SUPERBLOCK 

Once an object has a name, the name Is used to 
help identify It In responses. 
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U7. thank you. 
YOU'RE WELCOME! 



The special Id lom-def Inlng capabilities allow 
us to Include such pleasantries without 
modifying the grammar. 
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Chapter 2. Syntax 

2.1 Basic Approach to Syntax 
1*1*1 Syntax £r_d Meaning 

The decision to consider syntax as a proper study devoid of 
semantics Is a basic tenet of most current linguistic theories. 
Language Is viewed as a way of organizing strings of abstract 
symbols, and competence Is explained In terms of symbol- 
manipulating rules* At one level this has been remarkably 
successful. Rules have been formulated which describe In great 
detail how most sentences are put together. The problem comes 
In trying to relate those symbols and structures to the function 
of language as a vehicle for conveying meaning. The same 
approach which has worked so well In accounting for the 
machinations of syntax has been unable to provide any but the 
most rudimentary end unsatisfactory accounts of semantics. 

The problem Is not that current theories are finding wrong 
answers to the questions they ask; It Is that they are asking 
the wrong questions. What Is needed Is an approach which can 
deal meaningfully with the question "How Is language organized 
to convey meaning?" rather than "How are syntactic structures 
organized when viewed In Isolation?". 

How does a sentence convey meaning beyond the meanings of 

Individual words? Here Is the place for syntax. The structure 

of a sentence can be viewed as the result of a series of 

grammatical choices made In generating It. The speaker encodes 

meaning by choosing to build the sentence with certain 

"features", and the problem of the hearer is to recognize the 
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presence of those features and Interpret their meaning. 

We want to analyze the possible choices of features and 
functions which grammatical structures can have. For example, 
we might note that all sentences must be either IMPERATIVE, 
DECLARATIVE, or a QUESTION, and that In the last case they must 
choose as well between being a YES-NO question or a WH- question 
containing a word such as "why" or "which". We can study the 
way In which these features of sentences are organized « which 
ones form mutually exclusive sets (called "systems"), and which 
sets depend on the presence of other features (like the set 
containing YES-NO and WH- depends on the presence of QUESTION), 
This can be done not only for full sentences, but for smaller 
syntactic units such as noun groups and prepositional groups, or 
even for individual words . 

In addition we can study the different functions a 
syntactic "unit" can have as a part of a larger unit. In 
"Nobody wants to be alone.", the clause "to be alone" has the 
function of OBJECT In the sentence, while the noun group 
"nobody" Is the SUBJECT. We can note that a transitive clause 
must have units to fill the functions of SUBJECT and OBJECT, or 
that a WH- question has to have some constituent which has the 
role of "question element" (like "why" In "Why did he go?" or 
"which dog" In "Which dog stole the show?"). 

In most current theories, these features and functions are 
implicit In the syntactic rules. There Is no explicit mention 
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of them, but the rules are designed In such a way that every 
sentence will In fact be one of the three types listed above, 
and every WH- question will In fact have a question element. 
The difficulty Is that there Is no attempt In the gran-mar to 
distinguish significant features such as these from the Infinite 
number of other features we could note about a sentence, and 
which are also Implied by the rules. 

If we look at the "deep structure' 1 of a sentence, again the 
features and functions are Implicit, The fact that It Is a YES- 
NO question Is Indicated by a question marker hanging from a 
particular place In the tree, and the fact that a component Is 
the object or subject Is determined from Its exact relation to 
the branches around It, The problem Isn't that there Is no way 
to find these features In a parsing, but that most theories 
don't bother to ask "Which features of a syntactic structure are 
Important to conveying meaning, and which are just a by-product 
of the symbol manipulations needed to produce the right word 
order." 

What we would like Is a theory In which these choices of 
features are primary. Professor M.A,K, Halllday at the 
University of London has been working on such a theory, called 
Systemic Grammar (see references <HalIIday 1961, 1966a, 1966b, 
1967> <Hudd1eston>, <Hudson>). His theory recognizes that 
meaning Is of prime Importance to the way language Is 
structured. Instead of having a "deep structure" which looks 
like a kind of syntactic structure tree, he deals with "system 
networks" which describe the way different features Interact and 
depend on each other. The primary emphasis Is on analyzing the 
limited and highly structured sets of choices which are made In 
producing a sentence or constituent. The exact way In which 
these choices are "realized" In the final form Is a necessary 
but secondary part of the theory. 
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The realization rules carry out the work which would be 
done by transformations In transformational grammar (TG) . In 
TG, the sentences "Sally saw the squirrel. ", "The squirrel was 
seen by Sally. ", and "Did Sally see the squirrel?" would be 
derived from almost Identical deep structures, and the 
difference In final form Is produced by transformations. In 
systemic grammar, these would be analyzed as having most of 
their features In contnon, but differing In one particular 
choice, such as PASSIVE vs. ACTIVE, or DECLARATIVE vs, QUESTION. 
The realization rules would then describe the exact word order 
used to signal these features. 

What does this theory give us to use In a language 
understanding program? What kinds of parsings does It produce? 
If we look at a typical parsing by a systemic grarrmar, we note 
several points. First, It Is very close to the surface 
structure of the sentence. There Is no rearrangement Into 
supposed "underlying" forms. Instead, each constituent Is 
marked with features Indicating Its structure and function. 
Instead of saying that "Did John go?" has an underlying 
structure which looks like "John went.", we simply note that tt 
has the features QUESTION and YES-MO, and that the noun eroup 
"John" has the function SUBJECT. Other parts of the language 
understanding process do not have to be concerned with the exact 
way the parsing tree Is structured, since they can deal directly 
with the relevant features and functions. 
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What Is more Important Is that these features are not 
random unrelated lists of observations. They are part of a 
highly structured network, and the grammatical theory Includes a 
description of that network. When we do semantic analysis, we 
are not faced with the task of Inventing "projection rules" to 
deal with the raw form of specific syntactic rules. Instead we 
can ask "What aspect of meaning does this system convey?"/ and 
"What Is the significance of this particular feature within Its 
system?". 
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2-1-2 Posing 

In Implementing a systemic grammar for a computer program 
for understanding language, we are concerned with the process of 
recognition rather than that of generation. We do not begin 
with choices of features and try to produce a sentence. Instead 
we are faced with a string of letters, and the job Is to 
recognize the patterns and features In It. We need the Inverse 
of realization rules -- Interpretation rules which look at a 
pattern. Identify Its structure, and recognize Its relevant 
features* This Interpretation process Is closely related to 
other types of pattern recognition, and many Interesting 
parallels can be drawn with the process of Interpreting, a visual 
scene (see <Wlnograd 1969>), The Important aspect of both types 
of Interpretation Is looking for symbolic features which will be 
relevant to understanding, so that the parsing can be Integrated 
with the rest of the understanding process. In general, this 
problem of Isolating Important features from complex Information 
and representing them symbolically Is a central Issue for 
Artificial Intelligence, and the Idea of a "systemic" parser may 
be of use In other areas. 

The parsing system for our program Is actually an 
Interpreter for PROGRAMMAR, a language for writing grammars. It 
Is basically a top-down left- to-right parser, but It modifies 
these properties when It Is advantageous to do so. By writing 
In a language designed for grammars, we can express the 
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regularities of language In a straightforward way, as simply as 

In a syntax-directed parser. The primitives of the language are 

those operations which have particular relevance to natural 

language and Its parsing, 

A program for parsing language Is as much a "generative 11 

description of the language as Is a set of rules for producing 

sentences. The meaning of "generative" In Chomsky's original 

sense (<Chomsky 1957, 19E5>) Is that the grammar should 

associate a structural description to each permissible sentence 

In the language. A parsing program does just that. 

By making the formalism for grammars a programming 
language, we enable the grammar to use special tools to handle 
complex constructions and Irregular forms. For example, we can 
set up programs to define certain words like "and", and "or" 
as "demons", which cause an Interrupt In the parsing process 
whenever they are encountered In the normal lef t-to-r Ight order. 
In order to run a special program for conjoined structures. 
Idioms can also be handled using this "Interrupt" concept. In 
fact, the process can be Interrupted at any point In the 
sentence, and any other computations (either semantic or 
syntactic) can be performed before going on. These may 
themselves do bits of parsing, or they may change the course the 
basic program will take after they are done. 

It Is paradoxical that linguistic workers familiar with 
computers have generally not appreciated the Importance of the 
"control" aspect of programming, and have not used the process- 
describing potentialities of programming for their parsing 
theories. They have Instead restricted themselves to the 
narrowest kinds of rules and transformations -- as though a 
programmer were to stick to such simple models as Turing 
machines or Post productions. Designers of computer languages 
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today show this same tendency! See Mlnsky's remark In his 
Turing lecture <Mlnsky 1970>. Our parser uses semantic guidance 
at all points, looking for a meaningful parsing of the sentence 
rather than trying all of the syntactic possibilities. Section 
2.2 describes PROGRAMMAR In detail, and 2,3 gives a sample 
grarrenar for English, Section 2.1* explains programming details, 
and shows how the special features of the language are actually 
used to handle specific linguistic problems. 
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2.2 A Description of PROGRAMMER 

2-2.1 GrapFar ani computers 

In order to explain the features of PROGRAMMAR, we will 
summarize some of the principles of grammar used In computer 
language processing. The basic form of most grammars Is a list 
(ordered or unordered) of "replacement rules, 11 which represent a 
processs of sentence generation. Each rule states that a 
certain string of symbols (Its left side) can be replaced by a 
different set of symbols (Its right side). These symbols 
Include both the actual symbols of the language (called terminal 
symbols) and add 1 1 lonal "non-terminal" symbols. One non- 
terminal symbol Is designated as a starting symbol, and a string 
of terminal symbols Is a sentence If and only If It can be 
derived from the start I ng symbol through success I ve appl Icat Ion 
of the rules* For example we can write Rrarrmar 1: 



1.1 S -> NP VP 

1.2 NP -> DETERMINER NOUN 

1.3 VP -> VERB/INTRANSITIVE 
1.1* VP -> VERB/TRANSITIVE NP 

1.5 DETERMINER -> the 

1.6 noun -> giraffe 

1.7 NOUN -> apple 

1.8 VERB/INTRANSITIVE -> dreams 

1.9 VERB/TRANSITIVE -> eats 

Figure 20 — GRAMMAR 1 



By starting with S and applying the list of rules (1.1 1.2 
1.5 1.6 1*1* 1.2 1.7 1.5 1.9), we get the sentence "The giraffe 
eats the apple." Several things are noteworthy here. This Is an 
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unordered set of rules. Each rule can be applied any number of 
times at any point in the derivation where the symbol appears. 
In addition, each rule Is optional. We could just as well have 
reversed the applications of 1.6 and 1,7 to get "The apple eats 
the giraffe,", or have used 1.3 and 1.8 to get "The giraffe 
dreams." This type of derivation can be represented graphically 
as; 



DETERMINER 




NOUN 



t te 



VP 

/ 
VERB/TRANSITIVE 



NP- 



air ffe eats 

Figure 21 — Parsing Tree 



I 



N0U« 

I, 

apple 



We will call this the parsing tree for the sentence, and 
use the usual terminology for trees (node, subtreee, daughter, 
parent, etc.). In addition we will use the linguistic terms 
"phrase" and "constituent" Interchangeably to refer to a 
subtree. This tree represents the "Immediate constituent" 
structure of the sentence. The PROGRAMMAR language Is a general 
parsing system which, although oriented ttoward systemic 
grammar, can be used to parse grammars based on other theories. 
In descrblng PROGRAMMAR we have used a more conventional set of 
notations and analysis of English In order to make the 
description Independent of the work presented In later sections. 
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2.2.7 Context-free £n£ Context-sensitive Gramnars 

Grammar 1 Is an example of what Is called a context-free 
grammar. The left side of each rule consists of a single 
symbol, and the Indicated replacement can occur whenever that 
symbol \s encountered. There are a great number of dl f f erent 
forms of grammar which can be shown to be equivalent to this 
one, In that they can characterize the same languages. It has 
been pointed out that they are not theoretically capable of 
expressing the rules of English, to produce such sentences as, 
"John, Sidney, and Chan ordered an eggroll, a ham sandwich, and 
a bagel respect 1 vet y." Much more important, even though they 
could theoretically handle the bulk of the English language, 
they cannot do this at all efficiently. Consider the simple 
problem of subject-verb agreement. We would like a grammar 
wh Ich generates "The g! raf fe dreams , " and "The gl raf f es dream. ", 
but not "The giraffe dream." or "The giraffes dreams.". In a 
context-free grammar, we can do this by Introducing two starting 
symbols, S/PL and S/SG for plural and singular respectively, 
then duplicating each rule to match. For example, we would 
have: 

1.1.1 S/PL -> NG/PL VP/PL 

1.1.2 S/SG -> NG/SG VP/SG 

1.2.1 NG/PL -> DETERMINER NOUN/PL 

1.2.2 NG/SG -> DETERMINER NOUH/SG 



1.6.1 NOUN/PL -> giraffes 

1.6.2 NOUN/SG -> giraffe 

etc. 
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If we then wish to handle the difference between "I am", 
"he Is 11 , etc. we must Introduce an entire new set of symbols 
for first-person. This sort of duplication propaeates 
mul tlpl Icatlvel y through the grammar/ and arises In all sorts of 
cases. For example, a question and the corresponding statement 
will have much In common concerning their subjects, objects 
verbs, etc., but In a context-free grammar, thfey will Ih general 
be expanded through two entirely different sets of symbols. 

One way to avoid this problem Is to use context-sensitive 
rules. In these, the left side may Include several symbols, and 
the replacement occurs when that combination of symbols occurs 
In the string being generated. 
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1*1.} Systemic Granrar 

We can add power to our grammar with context-sens I tve rules 
which, for example. In expanding the symbol VERB/ INTRANS ITI VF, 
look to the preceding symbol to decide whether It Is singular or 
plural. By using such context-sensitive rules, we can 
characterize any language whose sentences can be listed by a 
determlnlst Ic (posslbl y neve rending) process. (I.e. they have 
the power of a turlng machine). There Is however a problem In 
Implementing these rules* In any but the simplest cases, the 
context will not be as obvious as In the simple example given. 
The choice of replacements will not depend on a single word, but 
may depend In a complex way on the entire structure of the 
sentence. Such dependencies cannot be expressed In our simple 
rule format, and new types of rules must be developed. 
Transformational grammar solves this by breaking the generation 
process down Into the context-free base grammar which produces 
"deep structure" and a set of transformations which then operate 
on this structure to produce the actual "surface structure" of 
the grammatical sentence. We will not go Into the details of 
transformational grammar, but one basic Idea Is this separation 
of the complex aspects of language Into a separate 
trans format lonal phase of the gene rat Ion process. 

Systemic grammar Introduces context In a more unified way 
Into the Immediate-const I tuent generation rules. This Is done 
by Introducing "features" associated with constituents at every 
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level of the parsing tree. A rule of the grammar may depend, 

for example, on whether a particular clause Is transitive or 

intransitive. In the examples "Fred found a frog.", "A frog was 

found by Fred,", and "What did Fred find?", all are transitive, 

but the outward forms are quite different. A context-sensitive 

rule which checked for this feature directly In the string being 

generated would have to be quite complex. Instead, we can allow 

each symbol to have additional subscripts, or features which 

control its expansion. In a way., this Is like the separation of 

the symbol HP Into NP/PL and NP/SG In our augmented context-free 

grammar. But It Is not necessary to develop whole new sets of 

symbols with d set of expansions for each. A symbol such as 

CLAUSE may be associated with a whole set of features (such as 

TRANSITIVE, QUESTION, SUBJUNCTIVE, OBJECT-QUESTION, etc.) but 

there is a single set of rules for expanding CLAUSE. These 

rules may at various points depend on the set of features 

present. 

The power of systemic grammar rests on the observation that 
the context-dependency of natural language Is centered around 
clearly defined and highly structured sets of features, so 
through their use a great deal of complexity can be handled wery 
economically. More Important for our purposes, there Is a high 
correlation between these features and the semantic 
Interpretation of the constituents which exhibit them. They 
cannot be put In a one-to-one correspondence with semantic 
properties of the phrases In which they appear, but are a 
tremendous aid to interpretation. 

A parsing of a sentence In a systemic grammar might look 

very much like a context-free parsing tree, except that to each 

node would be attached a number of features. These features are 
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not random combinations of facts about the constituent, but are 
a part of a carefully worked out analysis of a language In terms 
of Its "systems". The features are organized In a network, with 
clearly organized dependencies. For example, the features 
IMPERATIVE (command) and QUESTION are mutually exclusive In a 
clause, as are the features YES-NO {yes-no question like "Did he 
go?") and WH- question (like "Who went?). In addition, the 
second choice can be made only If the choice QUESTION was made 
In the first set. A set of mutually exclusive features Is 
called a "system", and the set of other features which must be 
present for the choice to be possible Is called the "entry 
condition" for that system. This Is discussed In detail In 
section 2.3. 

Another basic concept of systemic grammar Is that of the 
rank of a constituent. Rather than having a plethora of 
different non-terminal symbols, each expanding a constituent In 
a slightly different way, there are only a few basic "units", 
each having the possibility of a number of different features, 
chosen from the "system network 1 * for that unit. In an analysis 
of English, three basic units seem to explain the structure; the 
CLAUSE, the GROUP, and the WORD. In general, clauses are made 
up of groups, and groups made up of words. However, through 
"rankshlft", clauses or groups can serve as constituents of 
other clauses or groups. Thus, In the sentence "Sarah saw the 
student sawing logs." "the student sawing logs" Is a NOUN GROUP 
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with the CLAUSE "sawing logs" as a constituent (a modifier of 
"student"). 

The constituents "who", ''three days 11 , "some of the men on 
the board of directors," and "anyone who doesn't understand me" 
are all noun groups, exhibiting different features. This means 
that a PROGRAMS A R grammar will have only a few programs, one to 
deal with each of the basic units. Our current grammar of 
English has programs for the units CLAUSE, NOUN GROUP, VERB 
GROUP, PREPOSITION GROUP, and ADJECTIVE GROUP. 
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2 1 ? i ** Srappars fl£ Programs 

Earlier we pointed out that a complete generative 
description of a language can be In the form of a program for 
parsing I t « For simple grammars/ there Is a close 
correspondence between the parsing program and the usual 
generation rules. 

We can think of a grammar as a set of Instructions for 
parsing a sentence In the language. A rule like: NP -> 
DETERMINER NOUN can be Interpreted as the Instruction "If you 
want to find a NP, look for a DETERMINER followed by a NOUN. 11 
Grammar 1 could be diagrammed as shown In Figure 22. 

The basic function used Is PARSE, a function which tries to 
add a constituent of the specified type to the parsing tree. If 
the type has been defined as a PROGRAMMAR program, PARSE 
activates the program for that unit, giving it as Input the part 
of the sentence yet to be parsed and (optionally) a list of 
Initial features. If no definition exists, PARSE Interprets Its 
arguments as a list of features which must be found In the 
dictionary definition of the next word In the sentence. If so. 
It attaches a node for that word, and removes It from the 
remainder of the sentence. If not. It falls. If a PROGRAMMAR 
program has been called and succeeds, the new node Is attached 
to the parsing tree. If It falts, the tree Is left unchanged. 



DEFINE progrur: SENTFMTr 



PARSE a HP- 
suJLd? 
PARSE a VP- 



fnl 1? 



run? 



4 

wor 
>ft1 



any words, 
left? 



RETURN success 
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■^RETURN failure 



DEFINE program MP 



PARSE a DETERMINER. 
PARSE a NOUN 



A 

1 



RlTURH success 



^RETURN fal lure 



j 



DEFINE program VP 
PARSE a VERB 

Is It TRANSITIVE? 



t 



Is It INTRANSITIVE?- 
RETURN success-^ 



-^ RETURN failure 
fa 



PARSE a NP: 
I 



Figure 22 — Simple Parsing Program 
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2.2.5 The Form of PROGRAHMAR Grammars 

Written In PROGRAMMAR, the programs would look like: 



2.1 (PDEFINE SENTENCE 

2.2 (((PARSE NP) NIL FAIL) 

2.3 ((PARSE VP) FAIL FAIL RETURN))) 

2 . i* (PDEFINE NP 

2.5 (((PARSE DETERMINER) NIL FAIL) 

2.6 ((PARSE NOUN) RETURN FAIL))) 

2.7 (PDEFINE VP 

2.8 (((PARSE VERB) NIL FAIL) 

2.9 <(ISQ H TRANSITIVE) NIL INTRANS ) 

2.10 ((PARSE NP) RETURN NIL) 

2.11 INTRANS 

2.12 <(1SQ H INTRANSITIVE) RETURN FAIL))) 

Rules 1.6 to 1.9 would have the form: 

2.13 (DEFPROP GIRAFFE (NOUN) WORD) 

2,1* (DEFPROP DREAM (VERB INTRANSITIVE) WORD) 
etc. 

Figure 23 — Grammar 2 



This example Illustrates some of the basic features of 
PROGRAMMAR. First It Is embedded In LISP, and much of Its 
syntax Is LISP syntax. Units, such as 5ENTENCE are defined as 
PROGRAMMAR programs of no arguments. Each tries to parse the 
string of words left to be parsed In the sentence. The exact 
form of this Input string Is described In section 2.1*. 8. The 
value of (PARSE SENTENCE) will be a list structure corresponding 
to the parsing tree for the complete sentence. 

Each time a call Is made to the function PARSE, the system 
begins to build a new node on the tree. Since PROGRAMMAR 
programs can call each other recursively. It Is necessary to 
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keep a pushdown list of nodes which are not yet completed (i.e. 
the entire rlfihtmost branch of the tree). These are all called 
"active" nodes, and the one forred by the most recent call to 
PARSE Is called the "currently active node". 

We can examine our sample program to see the basic 
operation of the language. Whenever a PROGRAMMAR program Is 
called directly by the user, a node of the tree structure Is set 
up, and a set of special variables are bound (see section 
2.*i*9). The lines of the program are then executed In sequence, 
as In a LISP PROG, except when they have the special form of a 
BRANCH statement (a list whose first member (the CONDITION) Is 
non-atomic, and which has either 2 or 3 other members, called 
DIRECTIONS). Line 2,3 of GRAMMAR 2 Is a three-direction branch, 
and all the other executable lines of the program are two- 
direction branches. 

When a branch statement Is encountered, the condition Is 
evaluated, and branching depends on Its value. In a two- 
direction branch, the first direction Is taken If It evaluates 
to non-NIL, the second direction If It Is NIL. In a three- 
direction branch, the first direction Is taken only If the 
condition Is non-NIL, and there Is more of the sentence to he 
parsed. If no more of the sentence remains, and the condition 
evaluates non-NIL, the third direction Is taken. 

The directions can be of three types. First, there &re 
three reserved words, NIL, RETURN, and FAIL. A direction of NIL 
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sends evaluation Co the next statement In the program, FAIL 
causes the program to return NIL after restoring the sentence 
and the parsing tree to their state before that program was 
called. RETURN causes the program to attach the currently 
active node to the completed parsing tree and return the subtree 
below that node as Its value. 

If the direction Is any other atom. It acts as a GO 
statement, transferring evaluation to the statement Immediately 
following the occurence of that atom as a tag. For example. If 
a failure occurs In line 2.9, evaluation continues with line 
2.12. If the direction Is non-atomic, the result Is the same 
as a FAIL, but the direction Is put on a special failure message 
list, so the calling program can see the reason for failure. 
DIRECTIONS can also be used In the function GOCOND. The 
statment (GOCOND TAG1 TAG2) causes the program to go to TAG1 If 
there are words left to be parsed, and to TAG2 otherwise. 

Looking at the programs, we see that SENTENCE will succeed 
only If It first finds a NP, then finds a VP which uses up the 
rest of the sentence. In the program VP, we see that the first 
branch statement checks to see whether the next word Is a verb. 
If so. It removes It from the remaining sentence, and goes on. 
If not, VP falls. The second statement uses the PROGRAMMAR 
function ISQ, one of the functions used for checking features. 
OSQ A 8) checks to see whether the node or word pointed to by A 
has the feature B. H Is one of a number of special variables 
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used to hold Information associated with a node of the parsing 
tree, (see section 2.*. 9) It points to the last word or 
constituent parsed by that program. Thus the condition (ISQ H 
TRANSITIVE) succeeds only If the verb just found by PARSE has 
the feature TRANSITIVE. If so, the direction NIL sends It on to 
the next statement to look for a NP, and If It finds one It 
returns success. If either no such NP Is found or the verb Is 
not TRANSITIVE, control goes to the tag INTRANS, and If the verb 
Is INTRANSITIVE, the program VP succeeds. Note that a verb can 
have both the features INTRANSITIVE and TRANSITIVE, and the 
parsing will then depend on whether or not an object NP Is 
found. 
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2.2>6 Context-Sensitive Aspects 

So far, we have done little to go beyond a context-free 
grammar. How, for example, can we handle agreement? One way to 
do this would be for the VP program to look back In the sentence 
for the subject, and check Its agreement with the verb before 
going on. We need a way to climb around on the parsing tree, 
looking at Its structure. In PROGRAMMAR, this Is done with the 
pointer PT and the moving function *. 

Whenever the function * Is called. Its arguments form a 
list of Instructions for moving PT from Its present position. 
These Instructions can be quite general, saying things like 
"Move left until you find a unit with feature X, then up until 
you find a CLAUSE, then down to Its last const I cutent, and left 
until you find a unit meeting the arbitrary condition Y. M The 
Instruction list contains non-atomic CONDITIONS and atomic 
INSTRUCTIONS. The Instructions are taken In order, and v/hen a 
condition Is encountered, the preceding Instruction Is evaluated 
repeatedly until the condition Is satisfied. If the condition 
Is of the form (ATOM), It Is satisfied only If the node pointed 
to by PT has the feature ATOM, Any other condition Is evaluated 
by LISP, and Is satisfied If It returns a non-NIL value. 
Section 2-i». 10 lists the Instructions for *. 

For example, evaluating (• C U) will set the pointer to the 
parent of the currently active node. (The mnemonics are: 
Current, Up) The call (* C DLC PV (NP)) will start at the 
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current node, move down to the rightmost completed node (I.e. 
not currently active) then move left until It finds a node with 
the feature NP« (Down-Last-Completed, Previous). If * succeeds. 
It returns the new value of PT and leaves PT set to that value. 
If It falls at any point In the list, because the existing tree 
structure makes a command Impossible, or because a condition 
cannot be satisfied, PT is left at Us original position, and * 

returns NIL, 

We can now add another branch statement to the VP program 

In section 2.2.5 between lines 2.8 and 2.9 as follows: 

2.8.1 (<0R(AN0(ISQ(* C PV DLOS I NGULAR) ( I SQ H SINGULAR)) 

2.8.2 (ANDtlSQ PT PLURALXISQ H PLURAL))) 

2.8.3 NIL (AGREEMENT)) 

This Is an example of a branch statement with an error 
message. It moves the pointer from the currently active node 
(the VP) to the previous node (the NP) and down to Its last 
contltuent (the noun). It then checks to see whether this 
shares the feature SINGULAR with the last constituent parsed by 
VP (the verb). If not It checks to see whether they share the 
feature PLURAL. Notice that once PT has been set by *, It 
remains at that position. If agreement Is found, evaluation 
continues as before with line 2.9. If not, the program VP falls 
with the message (AGREEMENT). 

So far we have not made much use of features, except on 
words. As the grammar gets more complex, they become much more 
Important. As a simple example, we may wish to augment our 
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grammar to accept the noun groups "these fish," "this fish," 
"the giraffes," and "the giraffe," but not "these giraffe," or 
"this giraffes." We can no longer check a single word for 
agreement, since "fish" gives no clue to number In the first 
two, while "the" gives no clue In the third and fourth. Number 
Is a feature of the entire noun group, and we must Interpret It 
In some cases from the form of the noun, and In others from the 
form of the determiner. 

We can rewrite our programs to handle this complexity as 
shown In Grammar 3; 



3.1 (PDEFINE SENTENCE 

3.2 (((PARSE NP)NIL FAIL) 

3.3 ((PARSE VP) FAIL FAIL RETURN))) 



3.d (PDEFINE NP 

3.5 (((AND(PARSE DETERMINER) ( FQ DETERMINED) )NI L NIL FAIL) 

3.6 ((PARSE NOUN)NIL FAIL) 

3.7 ((CQ DETERMINED)DET NIL) 

3.8 ((AND(* HHTRNSF (QUOTE ( S I NGULAR PLURAL) )) )RETURN FAIL) 

3.9 DET 

3.10 ((TRNSF (MEET(FE(- H PV (DETERMINER))) 

3.11 (QUOTEtSINGULAR PLURAL)))) 

3.12 RETURN 

3.13 FAIL))) 

3.11* (PDEFINE VP 

3.15 (((PARSE VERBJNIL FAIL) 

3.16 ((MGETtFE H)(FE(« C PV ( NP) ) ) (QUOTEtS I NGULAR PLURAL))) 

3.17 NIL 

3.18 (AGREEMENT)) 

3.19 (OSQ H TRANSITIVEJNIL INTRANS) 

3.20 ((PARSE NP)RETURN NIL) 

3.21 < C ISQ H IHTRANSITIVE)RETLIRN FAIL))) 

Figure 2U -- Grammar 3 
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We have used the PROGRAMMAR functions FQ and TRHSF, which 
attach features to constituents. The effect of evaluating (FQ 
A) Is to add the feature A to the list of features for the 
currently active node of the parsing tree. TRHSF Is used to 
transfer features from the pointer to the currently active node. 
Its argument is a list of features to be looked for. For 
example, line 3.8 looks for the features SINGULAR and PLURAL In 
the last constituent parsed (the NOUN)/ and adds whichever ones 
It finds to the currently active node. The branch statement 
beginning with line 3.10 Is more complex. The function * finds 
the DETERMINER of the HP being parsed. The function FE finds 
the list of features of this node, and the function MEET 
Intersects this with the list of features (SINGULAR PLURAL), 
This Intersection Is then the set of allowable features to be 
transferred to the NP node from the NOUN. Therefore If there Is 
no agreement beween the NOUN and the DETERMINER, TRNSF falls to 
find any features to transfer, and the resulting failure causes 
the rejection of such phrases as "these giraffe." 

In line 3.7 we use the function CQ which checks for 
features on the current node. ( CQ DETERMINED) will be non-NIL 
only If the current node has the feature DETERMINED. (I.e. It 
was put there In line 3.5) Therefore, a noun group with a 
determiner Is marked with the feature DETERMINED, and Is also 
Eiven features corresponding to the Intersection of the number 
features associated with the determiner If there Is one, and the 
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noun. Notice that this grammar can accept noun groups without 
determiners, as in "Giraffes eat apples." since line 3.5 falls 
only If a DETERMINER Is found and there are no more words in the 
sentence. 

In conjunction with the change to the MP program, the VP 
program must be modified to check with the HP for agreement. 
The branch statement beginning on Line 5.16 does this by making 
sure there Is a number feature common to both the subject and 
the verb. 

This brief description explains some of the basic features 
of PROGRAMMAR, In a simple grammar, their Importance Is not 
obvious, and Indeed there seem to be easier ways to achieve Che 
same effect. As grammars become more complex, the special 
aspects of PROGRAMMAR become more and more Important. The 
flexibility of writing a grammar as a program Is needed both to 
handle the complexities of English syntax, and to combine the 
semantic analysis of language with the syntactic analysis In an 
Intimate way. Section 2. J describes a fairly complex grammar of 
English, and section h.2 describes the way It Is Integrated with 
the semantic programs. A number of the other features and 
details of PROGRAMMAR are described In Section 2.b. 
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2.2.7 Ambiguity and (Jnverstand I n& 

Readers familiar with parsing systems may by now have 
wondered about the problem of ambiguity. As explained, a 
PROGRAMMAR program tries to find a possible parsing for a 
sentence, and as soon as It succeeds. It returns Its answer. 
This Is not a defect of the system, but an active part of the 
concept of language for which It was designed. The language 
process Is not segmented Into the operation of a parser, 
followed by the operation of a semantic Interpreter. Rather, 
the process Is unified, with the results of semantic 
Interpretation being used to guide the parsing This Is very 
difficult In other forms of grammar, with their restricted types 
of context-dependence. But It Is straightforward to Implement 
In PROGRAMMAR. For example, the last statement In a program for 
NP may be a call to a noun-phrase semantic Interpreter. If It 
Is Impossible to Interpret the phrase as It Is found, the 
parsing Is Immediately redirected. 

The way of treating ambiguity Is not through listing all 
12i»5 possible Interpretations of a sentence, but In being 
Intelligent In looking for the first one, and being even more 
intelligent In looking for the next one If that falls. There Is 
no automatic backup mechanism In PROGRAMMAR, because blind 
automatic backup Is tremendously Inefflcent. A good PROGRAMMAR 
program will check Itself when a failure occurs, and based on 
the structures It has seen and the reasons for the failure. It 
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will decide specifically what should be tried next. This Is the 
reason for Internal fal lure-messseges, and there are facilities 
for performing the specific backup steps necessary. (See 
section 2.U.5) 

As a concrete example, we might have the sentence "I rode 
down the street in a car. 1 ' At a certain point In the parsing 
the NP program may come up with the constituent "the street In a 
car". Before going on, the semantic analyzer will reject the 
phrase "In a car" as a possible modifier of "street", and the 
program will attach It Instead as a modifier of the action 
represented by the sentence. Since the semantic system Is a 
Part of a complete deductive unders tander, with a definite 
world-model, the semantic evaluation which guides parsing can 
include both general knowledge (cars don't contain streets) and 
specific knowledge (Melvln owns a red car, for example). Humans 
take advantage of this sort of knowledge In their understanding 
of language, and It has been pointed out by a number of 
linguists and computer scientists that good computer handling of 

language will not be possible unless computers can do so as 
well. 

Few sentences seem ambiguous to humans when first read. 
They are guided by an understanding of what Is said to pick a 
single parsing and a very few different meanings. By using this 
same knowledge to guide Its parsing, a computer understanding 
system can take advantage of the same technique to parse 
meaningful sentences quickly and efficiently. 

We must be careful to distinguish between grammatical and 

semantic ambiguity. Although we want to choose a single parsing 
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without considering the alternatives simultaneously, we want to 
handle semantic ambiguity very differently. There may be 
several Interpretations of a sentence which are all more or less 
meaningful, and the choice between them will depend on a complex 
evaluation of our knowledge of the world, of the knowledge the 
person speaking has of the world, and of what has been said 
recently. 
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In understanding the reason for developing PROGRAMMAR, 
several factors are Important. The first Is that only through 
the flexibility of expressing a grammar as a program can we 
Introduce the type of Intelligence necessary for complete 
language understanding. PROBRAMMAR Is able to take Into account 
the fact that language Is structured In order to convey meaning, 
and that our parsing of sentences depends Intimately on our 
understanding that meaning. PROGRAMMAR can take advantage of 
this to deal more efficiently with natural language than a 
general rule-based system, whether context-free or 
transformational. More Important, the analysis returned by 
PROGRAMMAR Is designed to serve as a part of a total 
understanding process, and to lend Itself directly to semantic 
Interpretation. This was one reason for selecting systemic 
grammar, and has guided much of the design of the system. The 
exact way In which semantic Interpretation can be done, and the 
reasons why a systemic analysis Is Important will be discussed 
In sections b.l and U t 2 t 
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2,5 A Grammar of English 

2-5-1 About ih& Grammar 

This section describes the grarwar of English used by our 

system. It Is based on the principles of systemic uagBftl! 
<<Halllday 1961, 1966a, 1966b, 1967>), and emphasizes the 
analysis of the significant features exhibited by llngustlc 
structures, rather than their detailed form. Instead of giving 
a set of grammar rules or algorithms, this section will try to 
convey an Impressionistic overview of English grammar, giving 
examples of the most Important features. Section 2.3.11 follows 
the parser through two sample sentences, showing how the grammar 
Is used. The actual PROGRAMMAR gramrrar contains the details, 
and Is available on request. A few particularly Interesting 
parts of the detailed grammar are described In section 2.1*. 
Appendix A Is a glossary of features with references to the 
pages on which they are Illustrated, 

Before beginning, several -warnings are In order. First, 
this Is not by any means a complete grammar of English. The 
task of codifying an entire language In any formalism Is so 
large that It would be folly to try In the course of a single 
research project. Our goal was to cover a large enough portion 
of English syntax so that a user could converse comfortably with 
the system about Its toy-block world. There are whole areas of 
syntax which are Involved with conveying Information of types 
not Included In this narrow field (such as the emotional 
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reaction, mood, and emphasis of the speaker). These are not 
handled at all, and even within the toy-hlock world, there are 
numerous sentences and constructions which the grammar Is not 
yet equipped to handle. It will he of interest to see whether 
the basic structure of the syntactic theory Is flexible enough 
to add the treat amount of complexity which could be Included In 
a more complete grammar. 

Second, the grammatical theory Is used In a very Impure 
way. The main consideration was to produce a working grammar 
which could serve In a language-understanding program. The 
demands of practicality often overrode more theoretical 
criteria, and the resulting grammar Is not very "pretty". This 
Is especially true since It has evolved in a continuous process 
of writing and debugging, and has not yet undergone the 
"polishing" which removes the traces of its earlier stages of 
deve lopment . 

Demands of time made it Impossible to coordinate the 
writing of the grammar with other current versions of systemic 
grammar, so the analysis Is non-standard, often disagreeing with 
HallJday's analysis or other more complete versions. Some 
differences are simply notatlonal (using different names for the 
same thing), others are Intentional simplifications (Halliday's 
analysis Is much more complete), and some represent actual 
theoretical differences (for example, our analysis of the 
transitivity system puts much of the structure Into the semantic 
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rather than syntactic rules, while Halllday's Is more purely 
syntactic). We will not describe the dlffprenccs In detail, 
since this Is not a proposal for a specific version of English 
grammar. It Is Instead a proposal for a way of looking at 
language, and at English, pointing out some of the Interesting 
features. 
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2.?. 2 Vnlt5, Rank, £nd Features 

We will begin by descrlblnfi some of the basic concepts of 
systemic grammar, before giving details of their use In our 
analysis. Some of the description Is a repetition of material 
In Section 2.1, In that section we needed to give enough 
explanation of systemic grammar to explain PROGRAMMAR. Here we 
give a more thorough explanation of Its details. 

The first Is the notion of syntactic Uflili In analyzlne the 
constituent structure of a sentence (the way It Is built up out 
of smaller parts) * If we look at other forms of grammar, we see 
that syntactic structures are usually represented as a binary 
tree, with many levels of branching and few branches at any 
node. The tree Is not organized 1 Into "groupings" of phrases 
which are used for conveying different parts of the meaning. 
For example, the sentence "The three big red dogs ate a raw 
steak.' 1 would be parsed with something like the first tree In 
Figure 25. 

Systemic grammar pays more attention to they way language 
Is organized Into units, each of which has a special role In 
conveying meaning. In English we can distinguish three basic 
ranks of units, the CLAUSE, the GROUP, and the WORD. There are 
several types of groups: NOUN GROUP <NG), VERB GROUP (VG) 
PREPOSITION GROUP (PREPG) and ADJECTIVE GROUP (ADJG). In a 
systemic grammar, the same sentence might be viewed as having 
the second structure In Figure 25. 



Section 5.1.2 - Page 96 




I 



three ADJ 
big 



/ 


\ 


ate DET 

1 

HP? a 


ADJ NP1 


l\ 


1 J, 


ADJ NP2 


raw NOUN 


| 1 


1 


red NOUN 


steak 



dOR5 



Tree 1 



,-NG., 

DET NUM ADJ ADJ NOUN 

I I I I I 

the three big red dop.s 



CLAUSE 
I 

VG 

I 

VB 

I 
ate 



sr* 


v. 


DET ADJ 


NOUN 


1 1 


| 


a raw 


steak 



Tree 2 



Figure 25 - Parsing Trees 
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In this analysis, the WORD Is the basic building block. 
There are word classes like "adjective" , "noun", "verb", and 
each word Is an Integral unit -- It Is not chopped Into 
hypothetical bits (like analyzing "dogs" as being composed of 
"dog" and "-s" or "dog" and "plural"). Instead we view each 
word as exhibiting features. The word "dogs" Is the same basic 
vocabulary Item as "dog", but has the feature "plural" Instead 
of "singular". The words "took", "take", "taken", "taking", 
etc., are all the same basic word, but with differing features 
such as "past participle" (EN), "Infinitive" (INF), "-Ing" 
(ING), etc. When discussing features, we will use several 
notatlonal conventions. Any word appearing In all upper-case 
letters. Is the actual symbol used to represent a feature In our 
grammar and semantic programs. A feature name enclosed In 
quotes Is an English version which Is more Informative. Usually 
the program version is an abbreviation of the English version, 
and sometimes we will indicate this by typing the letters of the 
abbreviation in upper-case, and the rest In lower-case. Thus |f 
"determiner" Is abbreviated as DET, we may write DETermlner. We 
may even write things like QuaNTIFIeR. When we want to be more 
careful, we will write "quantifier" (0.NTFR). 

The next larger unit than the WORD Is the GROUP, of which 
there are the four types mentioned above. Each one has a 
particular function In conveying meaning. Noun groups (NG) 
describe objects, verb groups (VG) carry complex messages about 
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the time and modal (logical) status of an event or relationship, 
preposition groups (PREPG) describe simple relationships, while 
adjective groups (ADJG) convey other kinds of relationships and 
descriptions of objects. These semantic functions are described 
In more detail In section k.2. 

Each GROUP can have "slots" for the words of which it Is 
composed. For example, a NG has slots for "determiner" (DET), 
"numbers" (NUM.), "adjectives" (ADJ), "classifiers" (CLASF), and 
a NOUN. Each group can also exhibit features, just as a word 
can. A NG can be "singular" <NS) or "plural" (NPL), "definite" 
(DEF) as In "the three dogs" or "Indefinite" (INOEF) as In "a 
steak", and so forth. A VG can be "negative" <NEG) or not, can 
be MODAL (as In "could have seen"), and It has a tense. (See 
Section 2.3.8 for an analysis of complicated tenses, such as "He 
would have been going to be fixing It.") 

Finally, the top rank Is the CLAUSE. We speak of clauses 
rather than sentences since the sentence Is more a unit of 
discourse and semantics than a separate syntactic structure. It 
is either a single clause or a series of clauses joined together 
In a simple structure such as "A and B and...". We study these 
conjoining structures separately since they occur at all ranks, 
and there Is no real need to have a separate syntactic unit for 

sentence. 

The clause Is the most complex and diverse unit of the 
language, and Is used to express complex relationships and 
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events. Involving time, place, manner and many other aspects of 
meaning. It can be a QUESTION, a DECLARATIVE, or an IMPERATIVE, 
It can be "passive" (PASV) or "active" (ACTV), It can be a YES- 
NO question or a WH- question (like "Why...?" or "Which...?"). 

Looking at our sample parsing tree. Tree 2 In Figure 25, we 
see that the clauses are made up of groups, which are In turn 
made up of words However few sentences have this simple three- 
layer structure. Groups often contain other groups (for 
example, "the call of the wild" Is a NG, which contains the 
PREPG "of the wild" which In turn contains the KG "the wild"). 
Clauses can be parts of other clauses (as In "Join the Navy 1©. 
£££ lh£. world,."), and can be used as parts of groups In many 
different ways (for example. In the NG "the man who came ^ 
dinnex" or the PREPG "by leaving the country ".) This phenomenon 
Is called ranKsh'ft, and Is one of the haslc principles of 
systemic grammar. 

If the units can appear anywhere In the tree, what Is the 
advantage of grouping constituents Into "units" Instead of 
having a detailed structure like the one shown In our first 
parsing tree? The answer Is In the "features" we were noting 
above. Each unit has associated with It a set of features, 
which are of primary significance In conveying meaning. We 
mentioned that a clause could have features such as IMPERATIVE 
DECLARATIVE, QUESTION, ACTV, PASV, YES-NO, and WH- . These are 
not unrelated observations we can make about a clause. They are 
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related by a definite logical structure. The choice between 
YES-NO and WH- Is meaningless unless the clause Is a QUESTION, 
but if It Is a QUESTION, the choice must be made. Similarly, 
the choice between QUESTION, IMPERATIVE, and DECLARATIVE Is 
mandatory for a MAJOR clause (one which could stand alone as a 
sentence), but Is not possible for a "secondary" (SEC) clause, 
such as "the country which possesses Hi£ b^unto." The choice 
between PASV (as In "the ball was attended by John",) and ACTV 
{as In "John attended the ball.") Is on a totally different 
dimension, since It can be made regardless of which of these 
other features are present. 

We can represent these logical relationships graphically 
using a few simple conventions. A set of mutually exclusive 
features (such as QUESTION, DECLARATIVE, and IMPERATIVE) Is 
called a system , and Is represented by connecting the features 
wl th a vertical bar: 

QUESTION 

DECLARATIVE 

IMPERATIVE 
The vertical order Is not Important, since a system Is a 
set of unordered features among which we will choose one. Each 
system has an ontrv condition which must be satisfied In order 
for the choice to be meaningful. This entry condition can be an 
arbitrary boolean condition on the presence of other features. 
The simplest case (and most common) Is the presence of a single 
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other feature. For example, the system just depicted has the 
feature MAJOR as Its entry condition, since only MAJOR clauses 
make the choice between DECLARATIVE, IMPERATIVE, and QUESTION. 

This simple entry condition Is represented by a horizontal 
line, with the condition on the left of the system being 
entered. We can diagram some of our CLAUSE features as: 

DECLARATIVE 

MAJOR IMPERATIVE 

SEC QUESTION 



CLAU 



H 



IYES-N0 
IWH- 



Often there are Independent systems of choices sharing the 
same entry condition. For example, the choice between SEC and 
MAJOR and the choice between PASV and ACTV both depend directly 
on the presence of CLAUSE. This type of relationship will be 
Indicated by a bracket In place of a vertical bar. 

| MAJOR . .. 

JSEC 
PASV 



CLAUSE 



ACTV 

If we want to assign a name to a system (to talk about It), 
we can put the name above the line leading Into It: 

VOICE IPASV 
I ACTV 
We can look at these notations as representing the logical 
operations of "or" and "and", and! we can use them to represent 
more complex entry conditions. If the choice between the 
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features C and D depends on the presence of either A o_£ B, we 

draw; 



A — 
B — 



and If the entry condition for the "C-D" system Is the 
presence of both A and B, we write: 




Finally, we can allow "unmarked" features, In cases where 
the choice Is between the presence or absence of something of 
Interest. We might have a system like; 
NEGATIVITY (NEGATIVE 



In which the feature "non-negative" Is not given a name, but Is 
assumed unless the feature NEGATIVE Is present. 

He will explain our grammar by presenting the system 
networks for all three ranks -- CLAUSE, GROUP, and WORD, and 
giving examples of sentences exhibiting the features. We have 
not attempted to show all of the logical relationships In the 
networks — our networks may Indicate combinations of features 
which are actually not possible, and would need a more complex 
network to represent properly. We have chosen clarity over 
completeness whenever there was a conflict. In addition, we 
have represented "features" of units (I.e. descriptions of their 
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structure) and "functions 11 (descriptions of their use) In the 
same network. In a more theoretical presentation. It would be 
preferable to distinguish the two. The names chosen for 
features were arbitrary mnemonics Invented as they were needed, 
and are neither as clear nor as systematic as they might be In a 
"cleaned up" version. 
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2-3^5 The £Uii££ 

The structure exhibiting the greatest variety In English I* 
the CLAUSE. It can express relationships and events Involving 
time, place, manner, and other modifiers. Its structure 
Indicates what parts of the sentence the speaker wants to 
emphasize, and can express various kinds of focus of attention 
and emotion. It determines the purpose of an utterance — 
whether It Is a question, command, or statement -- and Is the 
basic unit which can stand alone. Other units can occur by 
themselves when their purpose Is understood, as In answer to a 
question, but the clause Is the primary unit of discourse. 

The CLAUSE has several main Ingredients and a number of 
optional ones. Except for special types of Incomplete clauses, 
there Is always a verb group, containing Che verb, which 
Indicates the basic event or relationship being expressed by the 
CLAUSE. Almost every CLAUSE contains a subject, except for 
IMPERATIVE (In which the semantic subject Is understood to be 
the person being addressed), and embedded clauses In which the 
subject lies somewhere else In the syntactic structure. In 
addition to the subject, a CLAUSE may have various kinds of 
objects, which will be explained in detail later. It can take 
many types of modifiers (CLAUSES, GROUPS, and WORDS) which 
Indicate time, place, manner, causality, and a variety of other 
aspects of meaning. One part of the CLAUSE system network Is 
shown in Figure 26. 
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Flgure 26 -- NETWORK 1 
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Beginning at the top of the network, we see a choice 
between MAJOR (a clause which could stand alone as a sentence) 
and "secondary" (SEC). A MAJOR clause Is either an IMPERATIVE 
(a cofrtnand), a DECLARATIVE, or a QUESTION, Questions are either 
YES-NO — answerable by "yes" or "no", as Ini 

(si) Did you like the show? 
or WH- (Involving a question element like "when", "where", 
"which", "how", etc.). The choice of the WH- feature leads Into 
a whole network of further choices, which are shared by QUESTION 
and two kinds of SECondary clauses we will discuss later. In 
order to share the network, we have used a simple notatlonal 
trick -- the symbols contain a "•", and when they are being 
applied to a question, we replace the • with "Q", while when 
they are applied to relative clauses, we use "REL", For 
example, the feature "PREP*" In the network will be referred to 
as PREPQ when we find It In a question, but PREPREL when It \t 
In a relative clause. This Is due to the way the grammar 
evolved, and In later versions we will probably use only one 
name for these features. This complex of features Is basically 
the choice of what element of the sentence Is being questioned. 
English allows us to use almost any part of a clause as a 
request for Information. For example. In a PREPQ, a 
prepositional group In the clause Is used, as In: 

(s2) With what did you erase It? 
We more comnonly find the preposition In a DANGLING position, as 
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In: 

(s3) What did you erase It wi th ? 
We can tell by tracing back through Network 1 that sentence s3 
has the features PREPQ, DANGLING. WH-, QUESTION, and MAJOR. 

We can use a special question adverb to ask questions of 
time, place, and manner, as In: 

(sU) W h y did the chicken cross the road? 

(s5) When were you born? 

(s6) How will you tell her the news? 

(s7) Where has my little dog gone? 

These are all marked by the feature ADJQ. In discourse they can 

also appear In a short form (SHORT) In which the entire 

utterance Is a single word, as In: 

(s8) UhY? 
We can use the word "how" In connection with a measure adverb 
(like M fast n ) to ask an ADVMEASQ, like; 

(s9) How fast can he run the ml le? 

The most flexible type of WH- question uses an entire noun 
group as the question element, using a special pronoun (like 
"what" or "who 1 ') or a determiner (like "which", or "how many") 
to Indicate that It Is the question element. These clauses have 
the feature NGQ, and they can be further divided according to 
the function of the NG In the clause. It can have any of the 
possible NG functions (these w|M be described more formally 
with regard to the next network). For example. It can be the 
subject, giving a SUBJQ, like: 

(slO) Wftl gh hand holds the M and M ! s? 
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It can be the subject of a THERE clause {see below), giving us 
SUBJTQ: 

(sll) How tnanv Puerto Rlcans are there In Boston? 

A complement Is the second half of an "Is" clause, like: 

(sJ2) Her hair Is Xfid.. 
and It can be used to form a COHPQ: 
(sl3) What color was her hair? 
or with a "measure" In a MEASQ: 
(sib) How deeo Is the ocean? 

The noun group can be an object, leading to the feature 

OBJQ, as In: 

(sl5) What do you want? or 

(sl6J WJio. did you give the book? 

These are both 0BJ1Q, since the first has only one object 

("what"), and the second questions the first, rather than the 

second object ("who". Instead of "the book"). We use the 

ordering of the DECLARATIVE form "You gave me. ih£. book". If 

this were reversed, we would have an 0BJ2Q, like: 

(sl7) What did you give him? 

If we use the word "to" to express the first object with a 

two object verb like "give", we can get a TRANST02Q, like: 

(sl8) To tfh pfn did you give the book? or 

(sl9) WJ12 did you give the book to? 

Sometimes a NG can be used to Indicate the time In a 

clause, giving us a TIMED,: 

(s20) What ga,v will the Iceman come? 
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In a more complex style, we can embed the Question element 
within an embedded clause, such as: 

(s21) Whic h car did your brother say that 

he was expecting us to tell Jane to buy? 

The NG "which car 11 Is the question element, but Is In fact 
the object of the clause "Jane to buy...", which Is embedded 
several layers deep. This kind of NGQ Is called DOWNQ. The 
role of the question element In the embedded clause can Include 
any of those which we have been describing. For example It 
could be the object of a preposition, as In 

(s22) h'hflt state did you say Lincoln was born in? 

Looking at the network for the features of SECondary 

clauses, we see three main types — AOJUNCT, "Rank-Shifted 

Qualifier" (RSQ), and "Rank-Shifted to act as a Noun Group" 

(RSNG). ADJUNCT clauses are used as modifiers to other 

clauses, glvlns time references, causal relationships, and other 

similar Information, We can use a BOUND clause contal nlng a 

"binder" such as "before", "while", "because", "If", "so", 

"unless", etc., as In; 

(s23) While Nero fiddled . Pome burned. 

(s2U) If It rains , stay home. 

(s25) Is the sky blue because I t Is cold ? 

To express manner and purpose, we use a TO clause or an ING 
clause: 

(s26) He died ifl save ^ from £U£ sins . 

(s27) The bridge was built using orlnl t Ive tools . 

The RSQ clause is a constituent of a NG, following the noun 
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In the "qualifier" position (see Section 2.3.5 for a description 

of the positions In a NG) . It Is one of the most commonly used 

secondary clauses, and can be of four different types. Three of 

them are classified by the form of the verb croup within the 

clause — TO, ING, and EN (where we use "en" to represent a past 

participle, such as "broken"): 

(s28) the man i£ see about fl Job 
($29) the piece holding £he door £n 
Cs30) a face weathered by sun and wind 

Notice that the noun being modified can have various roles 

In the clause- In examples 28 and 29, "piece" Is the subject of 

"hold", while "man" Is the object of "see". We could have said: 

(s31) the man l£ ifi the Job 

In which "man" Is the subject of "do". Our semantic analysis 

sorts out these possibilities In determining the meaning of a 

secondary clause. 

The fourth type of RSQ clause Is related to WH- questions, 

and Is called a WHRS. It uses a wh- element like "which" or 

"what", or a word like "that" to relate the clause to the noun 

It Is modifying. The different ways it can use this "relating" 

element are very similar to the different possibilities for a 

Question element In a WH- question, and In fact the two share 

part of the network. Here wc use the letters REL to Indicate we 

are talking about a relative clause, so the feature PREP* In 

Network 1 becomes PREPREL. In sentences (s2) through (s22), we 

Illustrated the different types of WH- questions. We can show 
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parallel sentences for WHRS RSQ clauses. The following list 

shows some examples and the relevant feature names: 

(s32> the thing with which j£ojj erased J_t. PREPREL 

<s33) che thing that *flu. erased J_t with PREPREL DANGLING 

JS3JO the reason x&x lh& chicken crossed the road RELADJ 

(s3S) the day when vou were born RELADJ 

(s36) the way v*fi wll 1 tell h£L lh& news RELADJ 

(s37) the place my. 1 It tie d^g. ha_s_ gone RELADJ 

Cs38J the reason why RELADJ SHORTREL 

! S ?jjJ th e hand which rocks £&& cradle SUBJREL 

IsiiQ) the number of Puerto Rlcans there are In Boston 

SUBJTREL 

(shl) the color h&l. h£±L ttii last week COMPREL 

(si*2) the depth the ocean wl 1 1 be MEASREL 

(sii3) the information thgt. yo u want 0BJ1REL 

(sUi*) the man you gave , the book 0BJ1REL 

(sli5) the book which yo u gave hip :- 0BJ2REL 

(si*6) the man to whom you gave the book, TRANST02REL 

(s<*7) the man vou gave the bo ok , to TRANST02REL 

tsii8) the day Xh£ Iceman came TIMEREL 

(s£i9) the car ioyj; brother said fi£ j^ expecting ±r5, £& 

IE 1 1. Jane £q buy DOWNREL 

<s50> the state J£ojj said Lincoln w^s. born In DOWNREL 

Notice that In sentences 36, 37, iiO, 1.1, 1.2, kk, 1.7, 1.8, 
1.9, and 50, there Is no relative word like "which" or "that". 
These could just as well all have been put In, but English gives 
us the option of omitting them. When they are absent, the 
CLAUSE Is marked with the feature RELOEL. 

Returning to our network, we see that there Is one other 
type of basic clause, the RSNG. This Is a clause which Is rank- 
shifted to serve as a NG. It can function as a part of another 
clause, a preposition group, or an adjective group. There are 
four basic types. The first two are TO and ING, as In: 

(s51) I like ifl II*. TO 

(s52) Bui 1 d t n R houses Is hard work. ING 

(s53) He got It by sajjjg, coupons. |NG 
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Notice that In s51, the RSNG clause Is the object (0BJ1), In 

s5Z It Is the subject <SUBJ>, and In s53 It Is the object of a 

preposition CPREPOBJ). We can have a separate subject within 

the TO and ING clauses, giving us the features SUBTO and SUB1NG: 

(s5lt) I wanted Ruth lo lead lifi revolution. SUBTO 

<s55> They liked JoJi^s ifiadlii£ J_t. SUBING 

The SUBING form takes Its subject In the possessive. 

In addition to ING and TO, we have the REPORT CLAUSE, which 

has the structure of an entire sentence, and Is used as a 

participant In a relation about things like hearing, knowing, 

and saying: 

(s56) She heard that £hj| other ififlfl) iifld won^. 
(s57> That sjifi wasn 't there surprised us. 
(s58) I knew h£. could do. 1 1. 

The word "that" Is used In s56 and sS7 to mark the beginning 

of the REPORT CLAUSE, so they are assigned the feature THAT. 

The absence of "that" Is left unmarked. 

If the subject of a clause Is In turn a R5NG clause, we may 

have trouble understanding It: 

(s59) That anyone wJifi knew £h& cgmfalngUop. C . PUl d h&XS. 
opened tJlC lock was obvious. 
There Is a special mechanism for rearranging the sentence by 
using the word "It", so that the complicated subject comes last: 
(s60) It was obvious that anyone w£fi JmfiW. Hifi comb | nation 
could have opened the IflcK^ 
In this case, we say that the RSNG clause Is serving as an 
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ITSUBJ. TO and I NO clauses can do the same: 

<s61) Itwlll be fun _l£ see them aea In. 

(s62) It was dangerous going us wj thout; a. parachute. 

The final type of RSNG Is Che WHRS, which Is almost 

Identical to the WHRS RSQ described above. Rather than go 

through the details again, we will Indicate how a few of our RSQ 

examples (sentences s32 to s50> can be converted, and will leave 

the reader to do the rest. 

(s63) I don't know jM iUS ilLd J_t with. PREPREL DANGLING 

ts6&) Ask him when h£ tf^s born. RELAOJ 

Cs65) He told me why. RELADJ SHORTREL 

ts66) It Is amazing box many Puerto Rlcans there are Jji 

Poston. SUBJTREL 

tsb/J Only her hairdresser knows what color her hair was. 

COMPRFL 
etc. 

Let us examine one case more carefully: 

(s68) I knew which £££ YOUr. brother said thgj ^ 
wis. expecting y^ j£ t-el 1 Jane i£ huv. 

Here we have a DOWNREL clause, "which car buy", serving 

as the object of the CLAUSE "I knew...". However, this means 
that somewhere below, there must be another clause with a slot 
Into which the relative element can fit. In this case. It Is 
the RSNG TO clause "Jane to buy", which Is missing Its ohject. 
This clause then has the feature UPREL, which Indicates that Its 
missing constituent Is somewhere above In the structure. More 
specifically It Is 0BJ1UPREL. 

Once this connection Is found, the program might change the 
pointers In the structure to place the retatlve as the actual 
0BJ1 of the embedded clause structure. In the current grammar. 
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Che pointers are left untouched, and special commands to the 
moving function * are used when the object Is referenced by the 
semantic program. 
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2.3.1* Transitivity 1q ^^ Clause 

In addition to the systems we have already described, there 
Is a TRANSITIVITY system for the CLAUSE, which describes the 
number and nature of Its basic constituents. We mentioned 
earlier that a CLAUSE had such components as a subject and 
various objects. The transitivity system specifies these 
exactly. We have adopted a very surface-oriented notion of 
transitivity. In which we note the number and basic nature of 
the objects, but do not deal with their semantic roles, such as 
"ranee" or "beneficiary". Halllday's analysis <HalIIday 1967> 
Is somewhat different, as It Includes aspects which we prefer to 
handle as part of the semantic analysis. Our simplified network 
Is: 
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Figure 27 --NETWORK 2 



Ilk.H 



The first basic division Is Into clauses with the main verb 

be", and those with other verbs. This Is done since RE clauses 

have very different possibilities for conveying meaning, and 

they do not have the full range of syntactic choices open to 

other clauses. BE clauses are divided Into two types -- THERE 

clauses, like: 

(s69) There was an old woman who I Ived In ■ shoe. 

and INTensIve BE clauses: 

(s70) War la hell. 
A THERE CLAUSE has only a subject, marked SUBJT, while an INT 
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CLAUSE has a SUBJect and a COMPlement. The COMPlement can be 
either a NG, as In s70 or: 

(s71) He was in agent Qf £h£ FBI . 
or a PREPG: 

(572) The klne was J_q xh& counting house. 

or an ADJG: 

<s73) Her strength was fantas t Ic. 
<s7ft) My daddy Is stronger tjun yours. 

Other clauses are divided according to the number and type 

of objects they have. A CLAUSE with no objects Is Intransitive 

(ITRNS): 

(s75) He Is running. 

With one object It Is transitive (TRANS): 

(s76) He runs a ml I 1 Iqcj machine. 

With two objects TRANS2: 

(s77) I gave nii love fl cherry, 

Some verbs are of a special type which use a location as a 
second object. One example Is "put 11 , as In: 
(s78) Put the block on Xb& table. 

Note that this cannot be considered a TRANS with a 
modlf ler / as In: 

(s79) He runs a milling machine in Chicago. 

since the verb "put 11 demands that the location be given. We 

cannot say "Put the block." This type of CLAUSE Is called 

TRANSL, and the location object Is the LOBJ. The LOBJ can be a 

PREPG as In 578, or a special adverb, such as "there" or 
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"somewhere", as In: 

(s80) W here dld V° u P ut lt? or 

(s81) Put It there. 

Some Intransitive verbs also need a locatlonal object for 
certain meanings, such as: 

(s82) The block Is sitting fiC lt£ table. 
This Is called ITRNSL. 

Finally, there are INTensIve clauses which are not BE 
clauses, but which have a COMPlement, as In: 

<s83) He felt sick. and 

C s&U ) He made me SJC k - 

We have not run into these with our simple subject matter, and a 

further analysis will be needed to handle them properly. 

Any of the constituents we have been mentioning can be 

modified or deleted when these features Interact with the 

features described In Network 1. For example In: 

<sE5) the block which I told vou ifl BSil fin the table 

the underlined CLAUSE Is TRANSL, but Its 0BJ1 Is missing since 

it Is an UPREL. 

English has a way of making up new words by combining a 
verb and a "particle" (PRT), producing a combination like "pick 
up", "turn on", "set off", or "drop out". These do not simply 
combine the meanings of the verb and particle, but there Is a 
special meaning attached to the pair, which may be very 
different from either word In Isolation. Our dictionary 
contains a table of such pairs, and the grammar programs use 
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them, A CLAUSE whose verb Is a part of PRT pair has the feature 
PRT. The particle can appear either Immediately after the word: 

(s8B) He .pji. ro ft awav the plan. 

or In a displaced posl t Ion (marked by the feature DPRT) r 

<s87) He threw the plans away. 

Regardless of whether there Is a PRT or not, we have the 
choice between the features passive (PASV) and active (ACTV}. 
ACTV places the semantic subject first: 

(588) liie President started the war. 

while PASV puts the semantic object first: 

(s89) The war was started by the President. 

If there Is a PREPG beginning with "by". It Is Interpreted as 
the semantic subject (as In sB9), and the CLAUSE has the feature 
AGENT. 

If the CLAUSE Is active and Its subejct Is a RSNG CLAUSE, 
we can use the IT form described earlier. This Is marked by the 
feature IT, and Its subject Is marked ITSUBJ, as In sentences 
60, 01, and 62. 
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2.3,5 UflUn Groups 

The best way to explain the syntax of the NOUN GROUP Is to 
look at the "slot and filler" analysis, which describes the 
different components It can have. Some types of NG, such as 
those with pronouns and proper nouns, will not have this same 
construction, and they will be explained separately later. 

We will diagram the tvpclal NG structure, using a "•" to 
Indicate that the same element can occur more than once. Most 
of these "slots" are optional, and may or may not be filled In 
any particular NG. The meanings of the different symbols are 
explained below. 



i — r 

DET ORD 



ADO- 



"T 

NOUN 



NUM ADJ' CLASF- NOUN Q. 

Figure 28 -- NG Structure 



The most Important Ingredient Is the NOUN, which Is almost 

always present (If It Isn't, the NG Is INCOHplete). It gives 

the basic Information about the object or objects belns referred 

to by the NG. Immediately preceding the NOUN, there are an 

arbitrary number of "classifiers" (CLASF). Examples of CLASF 

are: 

(s90) Plant 1 Ife 

(S91) water meter cover adjustment screw 

Notice that the same class of words can serve as CLASF and NOUN 

-- In fact Halllday uses one word class (called NOUN), and 
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distinguishes between the functions of "head" and "classifier". 
We have separated the two because our dictionary gives the 
meaning of words according to their word class, and nouns often 
have a special meaning when used as a CLASF. 

Preceding the CLASFs we have adjectives (ADJ), such as "big 
beautiful soft red..." We can distinguish adjectives from 
classifiers by the fact that adjectives can be used as the 
complement of a BE CLAUSE, but classifiers cannot. We can say 
"red hair", or "horse hair", or "That hair Is red.", but we 
cannot say "That hair Is horse.", since "horse" Is a CLASF, not 
an ADJ. Adjectives can also take on the COMPARatlve and 
Superlative forms ("red, redder / and reddest"), while 
classifiers cannot ("horse, horser, and horsest"! !?) . 

Immediately following the NOUN we can have various 
qualifiers (Q), which can be a PREPG: 

(s92) the man In the rroon 

or an ADJG: 

(s93) a night darl^e r thfln doom 

or a CLAUSE RSQ: 

(s9l») the woman wtlfl conducts iil£ orchestra 
We have already discussed the many types of RSQ clauses. In 
later sections we will discuss the PREPG and ADJG types which 
can occur as qualifiers. 

Finally, the first few elements In the NG work together to 
give Its logical description — whether It refers to a single 
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object, a class of objects, a group of objects, etc. The 
determiner (DET) Is the normal start for a NG, and can be a word 
such as "a", or "that", or a possessive. It Is followed by an 
"ordinal" (ORD). There Is an Infinite sequence of number 
ordinals ("first, second, third...") and a few others such as 
"last" and "next". These can be recognized since they are the 
only words that can appear between a OET like "the" and a 
number, as In: 

(s95) the next three days 

Finally there Is a NUMber. It can either be a simple 
Integer like "one", "two", etc. or a more complex construction 
such as "at least three", or "more than a thousand". It Is 
possible for a NG to have all of Its slots filled, as In: 

DET ORD NUM ADd ADJ CLASF CLASF NOUN 
the first three old red city fire hydrants 

Q(PREPG) Q.(CLAUSE) 

without covers you can find 

It Is Is also possible to have combinations of almost any 
subset. With these basic components In mind, let us look at the 
system network for NG In Figure 29. 
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Figure 29 —NETWORK 3 



First we can look at the major types of NG. A NG made up 
of a pronoun Is called a PRONG. It can be either a QUESTIon, 
like "who" or "what", or a non-question (the unmarked case) like 
"I", "them", "It", etc. The feature TPRONG marks a NG whose 
head Is a special TPRON, like "something", "everything", 
"anything", etc. These enter Into a peculiar construction 
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containing only the head and qualifiers, and In which an 
adjective can follow the head / as In: 

(595) anything green which Is bigger than the moon 

The feature PROPNG marks er\ NG made up of proper nouns, 
such as ''John", "Oklahoma", or "The Union Of Soviet Socialist 
Republ Ics." 

These three special classes of NG do not have the structure 
described above. The PRONG Is a single PRONoun, the PROPNG Is a 
string of PROPNs, and the TPRONG has Its own special syntax. 
The rest of the NGs are the unmarked (normal) type. They could 
be classified according to exactly which constituents are 
present, but In doing so we must be aware of our basic goals In 
systemic grammar. We could note whether or not a NG contained a 
CLASF or not, but this would be of minor significance. On the 
other hand, we do note, for example, whether It has a DET, and 
what type of DET It has, since this Is of key Importance In the 
meaning of the NG and the way It relates to other units. We 
distinguish betwen those with a determiner (marked DET) and 

those without one (NDET), as In: 

(s97) Cats adore fish. NDET 

(s98) Ihfi iai adored fi fish. DET 

The DET can be DEFInlte (like "the" or "that 11 ), INDEFInlte 

(like "a" or "an"), or a quantifier (QNTFR) (like "some", 

"every", or "no"). The DEFInlte determiners can be either 

DEMon5tratIve ("this", "that", etc.) or the word "the 11 (the 

unmarked case), or a POSSessIve NG, The NG "the farmer's son" 
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has the NG "the farmer 11 as Its determiner, and has the feature 

POSES to Indicate this. 

An INDEF NG can have a number as a determiner, such as: 

(s99) five gold rings 
(slOO) ai least a dozen eggs 

In which case It has the feature NUMDET, or It can use an INDEF 

determiner, such as "a". In either case It has the choice of 

being a QUESTIon, The question form of a HUMOET Is "how many", 

while for other cases It Is "which" or "what". 

Finally, an NG can be determined by a quantifier (QNTF3). 
Although quantifiers could be subclass If led along various lines, 
we do so In the semantics rather than the syntax. The only 
classifications used syntactically are between singular and 
plural (see below), and between NEGatlve and non-negative. 

If a NG Is either NUMD or QNTFR, It can be of a special 
type marked OF, 1 Ike: 

(slOl) three q± the offices 
(sl02) all a£ your dreams 

An OF NG has a OETermlner, followed by "of", followed by a 
DEFInlte NG. 

A determined NG can also choose to be INCOHplete, leaving 

out the NOUN, as an 

(sl05) Give me three. 
(slOU) I want none. 

Notice that there Is a correspondence between the cases which 

can take the feature OF, and those which can be INCOM. We 

cannot say either "the of them" or "Give me the.". Possessive* 
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are an exception (we can say "Give me Juan's." but not "Juan's 
of them"), and are handled separately (see below). 

The middle part of Network 3 describes the different 
possible functions a NG can serve. In describing the CLAUSE, we 
described the use of an NG as a SUBJ, COMP, and OBJects of 
various types. In addition. It can serve as the object of a 
PREPG (PREPOBJ), In: 

(slOS) the rape of Ihs. lock 
If It Is the object of "of" In one of our special OF NGs, It Is 
called an OFOBJ: 

(slOB) none of vour tricks 

A NG can also be used to Indicate TIME, as In: 

(sl07) Yesterday the world ended. 

(slOS) Ih£ dai sJi£ left , all work stopped. 

Finally, a NG can be the POSSesslve determiner for another 

NG. In: 

(sl09) lh£. cook 's kettles 

the NG "the cook" has the feature POSS, Indicating that It Is 
the determiner for the NG "the cook's kettle", which has the 

feature POSES. 

When a PRONG Is used as a POSS, It must use a special 

possessive pronoun, like "my", "your", etc. We can use a POSS 

In an Incomplete NG, like 

(sllO) Show me yours. 

(sill) John 's Is covered with mud. 

There Is a special class of pronouns used In these NG's 
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(labelled DEFPOSS), such as "yours", "mine", etc. 

Continuing to the last part of Network J, we see features 
of person and number. These are used to match the noun to the 
verb {If the NG Is the subject) and the determiner, to avoid 
ungrammatlcal combinations like "these kangaroo" or "the women 
wins". In the case of a PRONG, there are special pronouns for 
first, second, and third person, singular and plural. The 
feature NFS occurs only with the first-person singular pronouns 
("I", "me", "my", "mine"), and no distinction Is made between 
other persons, since they have no effect on the parsing. All 
singular pronouns or other singular NGs are marked with the 
feature NS . The pronoun "you" Is always treated as If It were 
plural and no distinction Is made between "we", "you", "they", 
or any plural (NPL) NG as far as the grammar Is concerned. Of 
course there Is a semantic difference, which will be considered 
In later chapters. 
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2.3.6 Preposition GfPVPS 

The PREPG Is a comparatively simple structure used to 
express a relationship. It consists of a PREPosltlon followed 
by an object (PREPOBJ), which (s either a HG or a RSNG CLAUSE. 
In some cases, the preposition consists of a two or three word 
combination instead of a single word, as In: 

(sll2) next ifl the table 
Csll3) sic ifle af. the house 

The grammar Includes provision for this, and the dictionary 
lists the possible combinations and their meanings. The words 
In such a combination are marked as PREP2. The network for the 
PREPG Is In Figure 30. 
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Figure 30 -- NETWORK it 





The PREPG can serve as a constituent of a CLAUSE In several 
ways. It can be a COMPlement: 



<slH) Is It In jh_e_ frltchen? 
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a locatlonal object UOBJ): 

(sll5) Put It on the table. 
an ADJUNCT: 

(sll6) He got It by sell Ing his soul. 
or an AGENT: 

(sll7) It was bought fey xh& devl 1 . 
If the PREPG Is a constituent of a QUESTION CLAUSE, It can be 
the question element by having a QUEST NR as Its object: 

(sll8) In what cltv 
Csll9) for how many days 
(sl20) by whor 

In which case the PREPG Is also marked QUEST. A PREPREL CLAUSE 
contains a RELPREPG: 

(sl21) the place la which she works 

If the CLAUSE Is an UPQUEST or an UPREL, the PREPG can be 
the constituent which Is "missing" the piece which provides the 
upward reference. In this case It Is also marked UPREL: 

(sl22) the lady I saw you wl th 
or UPQUEST: 

(sl23) Who did you knit It for ? 
In these cases. It Is also marked SHORT to Indicate that the 
object Is not explicitly In the PREPG. It can also be short If 
It Is a PREPG In a DANGLING PREPQ or PREPREL CLAUSE: 

(sl2it) what do you keep It J_q? 

Within a NG, a PREPG serves as a qualifier (QJ: 

<sl2SJ the man in the I ron masfr 
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or as the body of an OF NG: 
(sl26) some fi£ the people 
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(sl31) fls_ quick aa a flash 
COMPARatlve: 

tsl32) This one Is bigger, 
or QUESTIon: 

(5133) Hflfl well can he take dictation? 

The network Is arranged to show that a qualifier ADJG can 
be only of the first two forms -- we cannot say "a man bigger" 
without using "than", or say "a man bl«". In the special case 
of a TPRON such as "anything" as In: 

(sl3it> anything strange 
the word "strange" Is considered an ADJ which Is a direct 
constituent of the NG, rather than an ADJG. 

The grammar does not yet account for more complex uses of 

the word "than". 
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2*U Yejib Groups 

The English verb group Is designed to convey a complex 

combination of tenses so that an event can relate several time 

references. For example, we might have: 

(sl35) By next week you wll 1 have been 1 Ivljm 
here for a month. 

This Is said to have the tense "present In past In future". Its 

basic reference Is to Che future -- "next week", but It refers 

back to the past from that time, and also Indicates that the 

event Is still going on. This type of recursive tense structure 

has been analyzed by Halllday <Halllday 1966b> and our grammar 

adopts a variant of his scheme. 

Essentially the choice Is between four tenses, PAST, 

PRESENT, FUTURE, and MODAL. Once a choice between these has 

been made, a second, third, fourth, and even fifth choice can be 

made recursively. The combination of tenses Is realized In the 

syntax by a sequence of the auxllllary verbs "be", "have", and 

"going to", along with the ING, EN, and INFInltlve forms of the 

verbs. The restrictions on the recursion are: 

1. PRESENT can occur only at the outer ends of the series 
(at first and/or final choice). 

2. Except In the final two positions, the same tense 
cannot be selected twice consecutively. 

3. Future can occur only once other than In last 
posl t Ion. 

fe. Modal can be only In final position. 
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It Is Important to distinguish between the position of a 
word In the VG and the position of Its tesnse In the recursive 
tense feature -- the direction Is reversed. In sl35, "will" Is 
the first word, and "living" the last, while the tense Is 
PRESENT In PAST In FUTURE. Some sample verb groups and their 
tenses are: (from <HallIday 196Gb>) 



ACTIVE 

took - past 

takes - present 

wl 1 1 take - future 

can take - modal 

has taken - past In present 

was taking - present In past 

was going to have taken - past In future In past 

was going to have been taking - present In past In future In past 

PASSIVE 

Is taken - present 
could have been taken - past In modal 
has been going to have been taken - 

past In future In past In present 

Figure 32 -- Verb Group Tenses 



The structure of a finite VG (one taking part In this tense 
system — see below for other types) Is a sequence of verbs and 
auxiliaries In which the last Is the "main verb" (marked MVB 
and remembered by the parser), a^id the first Is either a MODAL, 
the word "will", or a "finite" verb (one carrying tense and 
number agreement with the subject). Interspersed In the 
sequence there may be adverbs, or the word "not" (or Its reduced 
form "n't"). The best way to describe the relationship between 
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the sequence of verbs and the tense Is by giving a flow chart 
for parsing a VG, This Is a good example of the usefulness of 
representing syntax In the form of procedures, as It describes a 
relatively complex system In a clear and succinct way. 

In the flow chart (Figure 35) the variable T represents the 
tense, and the symbol "." Indicates the addition of a member to 
the front of a list. The "-" Indicates replacement In the 
FORTRAN sense, and the function "REMOVE" removes words from Che 
Input string. The features used are those described for verbs 
In section 2.3.9, The con?nand ( FQ PASV) Indicates that the 
entire VG Is to be marked with the feature PASV (passive voice). 
The flow chart does not Indicate the entire parsing, but only 
that part relevant to determining the tense. 
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Figure 33 « Syntax of VG Tense Structure 
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This system of tenses Is operative only for FINITE verb 
groups. The network for the VG In general Is: 
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Figure S>* --NETWORK 6 



There are several types of VG which do not enter the normal 

tense system, but which have a specialized form. The IMPER VG 

Is used In Imperatives: 

(sl36> Fire; when ready. 
(sl37) Don ' t drop the baby. 

It consists of a verb In the INFInltlve form, possibly preceded 
by the auxllllary "do 11 or Its negative form "don't". The EN VG 
Is used in EN RSQ CLAUSES, like: 

(s!38) a man forsaken by his friends 
and consists of a past partlcple verb. The ING VG Is made up of 
an ING verb or the verb "being" followed by an EN verb. It Is 
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used In various types of ING clauses: 

(sl39> Being married Is great. 
(sUO) the girl pitting near the wall 

Similarly, the TO VG Is used In TO clauses. In the case of 

conjoined structures, the "to" may be omitted from the second 

clause, as In: 

(sUl) We wanted to stop the war and end repression. 

Such a VG Is marked TODEL. 

We separate those verb groups whose main verb Is M be M from 
the others, as they do not undergo the further choice between 
PASV and ACTV, These correspond to the same features for 
clauses, and are seen In the structure by the fact that a PA5V 
VG contains a form of the auxllllary "be" followed by the main 

verb In the EN form, as In: 

<sH2) The pangr was finished bv the deadline. 
<sU3) He wanted lc b£ kissed by the bride. 

Finally/ any VG can be NEGatlve, either by using a negative 
form of an auxllllary like "don't", "hasn't", or "won't", or by 
Including the word "not". 
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2-3.9 Words 

Our grammar uses a number of separate word classes, each of 
which can be divided Into subclasses by the features assigned to 

Individual words. It was necessary to make arbitrary decisions 
as to whether a distinction between groups of words should be 
represented by different classes or different features within 
the same class. Actually we could have a much more tree-like 
structure of word classes. In which the Ideas of classes and 
features were combined. Since this has not yet been done, we 
will present a list of the different classes In alphabetical 
order, and for each of them give descriptions of the relevant 
features. Many words can be used In more than one class, and 
some classes overlap to a large degree (such as NOUN and CLASF) . 
In our dictionary, we simply list all of the syntactic features 
the word has for all of the classes to which It can belong. 
When the parser parses a word as a member of a certain class. It 
sorts out those features which are applicable. Figure 35 Is a 
list of the word classes and their features, 

ADJ -- Adjective Is one of the constituents of a NG as well as 
being the main part of an AOJG. This class Includes words 



ke "big", "ready 11 , and "strange". The only features a 



re 



Superlative (as In "biggest") and COMPARatlve (as In 
"bigger"). 
ADV — We use the name "adverb" to refer to a whole group of 

words used to modify other words or clauses. It Is sort of a 
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CLASS FEATURES 

ADV ADV ADVAOV LOBJ PLACE PREPAOV TIMW TIM2 VBAO 

BINDER BINDER 

CLASF CLASF 

DET DEF DEM DET INCOM INDEF NEG NONUM NPL NS OFD PART 

QDET QNTFR 

ADJ ADJ COMPAR SUP 

NOUN MASS NOUN NPL NS POSS TIME T I Ml 

NUM NPL NS NUM 

NUMD NUMD NUMDALOHE NUMDAN MUMDAT 

ORO ORD TIMORO 

PREP PLACE PREP NEED2 

PREP2 PREP2 

PRON DEFPOSS NEG NFS NPL NS OBJ POSS PRON REL SUBJ 

PROHREL NPL NS PRONREL 
PROPN NPL NS POSS PROPN 

PRT PRT 

O.ADJ PLACE QADJ 

TPRON NEG NPL NS TPRON 

VB AUX BE DO EN HAVE IMPERF IMF I NG INGOB INGOB2 INT 

ITRNS ITRNSL MODAL MVB NEG PAST PRES QUAX REPOB REPOB2 
SUBTOB SUBTOB2 TOOB TOOB2 T02 TRANS TRANSL TRANSL2 TRANS2 
VB VFS VPL VPRT V3PS WILL 



Figure 35 -- Word Classes and Applicable Features 
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"mixed bag" of words which don't really fit anywhere else. 
The basic classification depends on what Is being modified, 
and has the terms (ADVADV VBAD PREPADV CLAUSEADV). An ADVADV 
Is a word like "very" which modifies other adverbs and 
adjectives. A VBAD modifies verbs, and Includes the class 
of words ending In "-ly" like "quickly" and "easily". A 
PREPADV modifies prepositions, as "directly" In "directly 
above the stove". A CLAUSEAOV Is a constituent of a clause, 
and can be either TIMW or PLACE. A TIMW like "usually", 
"never", "then", or "often" appears as a CLAUSE constituent 
specifying the time. The PLACE ADV "there" can either be an 
adjunct, as In: 

Cslbii) There I saw a miracle, 
or an LOBJ, as In: 

(sU5) Put It there. 

INDER -- Binders are used to "bind" a secondary clause to a 
major clause, as In: 



(slii6) Before" you got there- we left, 
(slU7) I'll go If ! 



go J_f_ you do. 

We do not assign any other features to binders. 
CLASF -- In Section 2.3.5 we discussed the use of CLASF as a 

constituent of a NG. The CLASF Is often another NOUN, but It 

appears In a position like an adjective, as In " bov scout". 
DET -• DETermlners are used as a constituent of a HG, as 

described in 2.3.5. They can have a number of different 

features, as described In the network; 
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A DET can be INDEFInlte, 1 1 ke "a" or "an" or the 
question determiners (QDET) "which", "what", and "how many". 
It can be DEFInlte, like "the" or the DEMonstratlve 
determiners "this", "that", "those", and "these". Or It can 
be a quantifier (QNTFR) like "any", "every", "some", etc. 
Quantifiers can have the feature OFD, Indicating that they 
can be used In an OF NG like: 

(sH8) some of my best friends 
We originally had a separate feature named INCOM Indicating 
whether they could be used In an Incomplete NG like; 

<sU9) Buy somfc. 
but later analysis showed these features were the same. Not 
all quantifiers are OFD -- we cannot say "every of the cats" 
or "Buy every." Quantifiers can also be NEGatlve, like "none" 
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or "no", or can be NQNUM, Indicating that they cannot be used 
with a number, such as "many" or "none" (we can say "any three 
cats" or "no three cats", but not "none three" or "many three"). 
The NG program takes these features Into account In deciding 
what NG constituents to look for. It also has to find agreement 
In number between the DET and the NOUN. A DET can have the 
features "singular" (NS), "plural" <NPL) or MASS (like "some" or 
"no", which can go with MASS nouns like "water"). A DFT can 
have more than one of these -- "the" has all three, while "all" 
Is MASS and NPL, and "a" Is just NS. 

NOUN -- The main constituent of a NG Is Its NOUN. It has a 
feature of number. Identical to that of the DETermlners It must 
match. The word "parsnip" Is NS, "parsnips" Is NPL, and 
"wheat" Is MASS. Some nouns may have more than one of these, 
such as "fish", which Is all three since It can be used In "a 
fish", "three fish", or "Fish Is my favorite food." In 
addition, a NOUN can be POSSesslve, like "parsnip's". 

In order to tell whether a NG Is functioning as a time 
element In a CLAUSE, we need to know whether Its NOUN can refer 
to time. We therefore have two features -- TIME words like 
"day", and "month", as In: 

(sl50) The next dav It started to snow, 
and TIM1 words like "yesterday" and "tomorrow". This 
Illustrates the Interaction between syntax and semantics. A 
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phrase like "the next visit" can be used to Indicate a time, 
since a "visit" Is an event. The actual distinction should 
be the semantic difference between "event" and "non-event" 

The grammar could be easily changed to look at the 
semantic features rather than syntactic features of the NOUN 
In deciding whether It could be the head of a TIME NG. 
NUM. — The class of NUHbers Is large (uncountably Infinite) but 
not very Interesting syntactically. For our purposes we only 
note the features NS (for "one") and NPL (for all the rest). 
In fact, our system does not accept numbers In nurerlc form, 
and has only been caucht to count to ten. 
NUMD -- In complex number specifications, II ke "at least three" 
or "more than a minion", there Is a NUMD. The features they 
can have are (NUMDAN NUMDAS NUMDAT NUMDALONE). NUMDAN words 
such as "more" and "fewer" are used with "than", while NUMDAS 
words such as "few" fit Into the frame "as. ..as", and NUMDATs 
are preceded by "at", as In "at least", and "at most". 
NUMDALONE Indicates that the NUMD can stand alone with the 
number, and includes "exactly" and "approximately". 

ORD — The class of ORDInals Includes the ordinal numbers 

"first", "second", etc., and a few other words which can fit 
Into the position between a determiner and a number, like 
"next", "last", and "only". Notice that Superlative 
ADJectlves can also fill this slot In the NG. 

PREP -- Every PREPG begins with a PREPosltlon, either alone, or 
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as part of a combination such as "on top of". In the 
combination case, the words following the Initial PREP have 
the feature PREP2. A PREP which cannot appear without a 
PREP2 (such as "next" which appears In "next to") are marked 
NEED2. 

PRON » PRONouns can be classified along a number of dimensions, 
and we can think of a large mul tl-dlmensional table with most 
of Its positions filled. They have number features (MS NPL 
NFS) (note that Instead of the more usual division Into 
first, second, and third person, singular and plural, we have 
used a reduced one In which classes with the same syntactic 
behavior are lumped together). They can be POSSesslve, such 
as "your" or "my", or POSSDEF, like "yours" or "mine". Some 
of the personal pronouns distinguish between a SUBJect form 
like "I" and an OBJect form like "me", there are also 
special classes like DEMonstratlve ("this" and "that") and 
PRONREL — the pronouns used In relative clauses, such as 
"who", "which", and "that". Those which can be used as a 
question element, such as "which" and "who" are marked QUEST. 

PROPN -- Proper nouns Include single words like "Carol", or 
Phrases such as "The American Legion" which could be parsed, 
but are interpreted as representing a particular object 
(physical or abstract). A PROPN can be NPL or NS, and Is 
assumed to be NS unless defined otherwise. 

PRT -- In Section 2.3.ii, we discussed clauses which use a 
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combination of a "particle" and a verb, like "pick up" or 
"knock out". The second word of these Is a PRT. 

QAOJ -- One class of QUESTION CLAUSE uses a QAOJ such as 

"where", "when", or "how" as Its question element. They can 
also be used In various kinds of relative clauses, as 
explained In Section 2.3.3. 

TPRON — There Is a small class of words made up of a quantifier 
and the suffix "-thing" which enter Into a special type of NG 
construction like "anything green". This Is not an 
abbreviation for a quantifier followed by a noun, since the 
NG "any block green" would have the same structure but Is not 

grammatical . 
VB -- The verb has the most complex network of features of any 
v/ord In our grammar. They describe Its tense, transitivity, 
number, and use, as well as marking special verbs like "be". 
The network Is In Figure J7. 

Verbs are divided Into AUXIlllarles and others 
(unmarked). AUXIlllarles are the "helping verbs" which 
combine with others In complex VG structures. They can have 
special NEGatlve forms, like "can't", or can appear standing 
alone at the beginning of a QUESTION, In which case they have 

the function QAUX, as In: 

(slSl) Wl 1 I I ever finish? 
The auxiliaries Include "be", "do", "have", "will", and the 
MODALs like "could" "can", and "must". Separate features are 
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used for these as they are critical In determining the 
structure of a VG. An AUX can choose from the system of 
person and number, distinguishing "third-person singular" 
(V3PS) as In "Is", "plural", as In "have", or "first 
singular" (VFS), used only for "am". 

Non-auxllllary verbs can be VPRT, which combine with a 
PRT, and they have a whole cluster of transitivity features. 
In Section 2.3.". we described the different transitivity 
features of the CLAUSE, and these are controlled by the verb. 
Ke therefore have the features (TRANS ITRNS TRANS2 TRANSL 
ITRNSL INTJ In addition, the verb can control what types of 
RSNG CLAUSE can serve as Its various objects. The feature 
names combine the type of CLAUSE (ING TO REPORT SUBTO SUBING) 
with either -OB or -0B2, to get a product set of features 

like SUBTOB and ING0B2. 

For example, the verb "want" has the features TOOB and 

SUBTOB, but not INGOB, REPOB, etc. since "I want to go." and 

"I want you to go." are grammatical, but "I want going. , I 

want that you go.", etc. are not. 

Finally, all of these kinds of verbs can be In various 
forms such as ING ("breaking"), EN ("broken"), INFInltlve 
("break), PAST ("broke"), and PRESent ("breaks"). The 
network does not Illustrate all of the relations, as some 
types (like MODAL) do not make all of these choices. 
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2.3.10 Conjunction 

One of the most complex parts of English Is the system of 
conjunction. This section presents a simplified version which 
has been Implemented using the special Interrupt feature of 
PROGRAMMAR (see Section 2.1*. 2 for details). This makes the 
parsing particularly simple. 

The basic concept Is that any unit In a sentence can be 
replaced by a COMPOUND unit of the same type. In the sentence: 

(sl52) I baked a. chocolate cake, three pies. , and some 
hashish brownies. 

the object Is a COMPOUND NG with three components. There can be 
a compound ADJ, as In: 

(sl53) a red , o_£ vel lpw . flag 
or a phrase can be ambiguous, as In: 
Csl5i*) blacK cats aj}d. horses 

This can be Interpreted as having either a COMPOUND NG, 
composed of the NGs "black cats" and "horses", or a single NG 
with a COMPOUND NOUN, "cats and horses". 

The features of a COMPOUND unit are determined by Its 
components and by the type of conjunction. The conjunction 
features are from the following network: 
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The first choice Is the actual conjunction used. The 
feature BOTH Indicates a word at the beginning of a COMPOUND 

structure, as In: 

(sl55) both you and your family 
the specific word depends on the conjunction — "both" with 
"and", "either" with "or, and "neither" with "nor". The 
features BOTH and NOR combine in: 

(sl56) neither you nor I 

A conjoined structure can be made up of two elements with a 
conjunction (as In the previous three examples), or a LIST 
connected with commas and a conjunction before the last element 
(as In sl52>, or It can be a list connected with conjunctions la 
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LISTA), as In: 

(sl57) cabbages an^ klncs and sealing wax and things 

Every constituent but the first Is narked with the feature 
COMPONENT. The COMPOUND unit also takes on features from Its 
constl tuents. It may have features such as number and tense, 
relevant to Its syntactic function. For example, a COMPOUND NG 
with the feature AND must be plural (HPL), while one with the 
feature OR will have the number feature of Its Initial 
constl tuent. 

Finally, there can be "forked" structures like: 

(sl58) Ha dropped the watermelon and ran. 
Notice that this cannot be viewed In the same way, since 
"dropped the watermelon" Is not a single constituent of any 
unit. This sentence Is viewed as a conjunction of two clauses, 
sharing the subject "we'\ The second clause Is marked with the 
feature SUBJFORK to Indicate this. Similarly, the subject and 
verb can be shared, as In: 

(sl59) He gave Greg the apple and Gloria the pear. 
The clause "Gloria the pear" Is marked as VBFORK and SUBJFORK* 
The only constituents relative to a fork are those necessary for 
the clause structure. In; 

<sl6Q) Thev easily saw the trouble and cheerfully fixed It. 
the parser does not determine whether the "easily" applies to 
both constituents, but leaves this decision to semantics. This 
Is In keeping with our orientation of syntax to surface 
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structure and the belief that much of "deep structure" Is more 
the province of the semantic analyzer. 
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JLiai FQl)gw)nfi ihfi Parser In Operation 

Let us follow the parser through two examples to see how 
the grammar Is used In practice* We will not actually watch all 
of the details/ or deal with the way semantic programs are 
Intermixed with the grammar. Instead we will follow a somewhat 
reduced version, to get a feeling for the way the grammar works, 
and the way It Interacts with the different features described 
above. He have chosen one very simple sentence, and another 
which Is difficult enough to exercise some of the more complex 
features of the grammar. The first sentence Is the first 
sentence of our sample dialog (Section 1.3): 
Pick up a big red block. 

The system begins trying to parse a sentence, which, as 
explained above, means looking for a MAJOR CLAUSE. It activates 
the grammar by calling (PARSE CLAUSE MAJOR). Since CLAUSE Is 
one of our units, there Is a program defined for It, The CLAUSE 
program Is called with an Initial feature list of (CLAUSE 
MAJOR). 

The CLAUSE program looks at the first word. In order to 
decide what unit the CLAUSE begins with. If It sees an adverb. 
It assumes the sentence begins with a single-word modifier. If 
It sees a PREPosltlon, It looks for an Initial PREPG. If It 
sees a BINDER, It calls the CLAUSE program to look for a BOUND 
CLAUSE. In English (and possibly all languages) the first word 
of a construction often gives a very good clue as to what that 
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construction will be. We have "advance notice" of what 
structures to look for, and this makes parsing much easier. Our 
grammar tries to make as much use as possible of these natural 
"signals". In this case, the Initial word Is a verb, and this 
Indicates that we may have an IMPERATIVE CLAUSE. The program 
calls (PARSE VG IMPER) to start the VG program with the Initial 
VG feature list (VG IMPER), looking for a VG of the right type. 
The VG program checks this Initial feature list, and sees that 
It Is looking for an IMPERatlve VG. This must either begin with 
some form of the verb "do", or with the main verb Itself. Since 
the next word Is not "do". It calls (PARSE V8 INF (MVB)). This 
Is a different kind of call to PARSE, since VB Is not a unit we 
have defined. It Is a word class, and the call says to check 
the next word In the Input (In this case still the first word) 
to see whether It Is Indeed the INFInltlve form of a VerB. If 
so. It Is to be attached to the parsing tree, and given the 
additional feature HV8 (main verb). The current structure can 
be dl a g rammed: 



(CLAUSE MAJOR) (pick...) 

(VG IMPER)" (pick...) 

(VB MVB INF TRANS VPRT) pick 

Figure 39 -- Syntactic Structure 1 



We use several conventions for diagramming syntactic 
structures. Rather than using a tree format (which quickly 
grows off of the page), we use a format more like a traditional 
outline, with the sub-phrases of any phrase Indicated 
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Immediately below It and Indented. We use the symbol "*" to 
Indicate the program which Is currently active, and show the 
actual words of each piece of the sentence to the right of the 
outline, putting units larger than WORD In parentheses. A 
series of dots In the English words to the right of a unit 
Indicates that the program for that unit has not yet finished. 
Figure 39 shows that we have a CLAUSE, with a constituent 
which Is a VG, and that the VG proeram Is active. The VG so far 
consists of only a VB. Notice that some new properties have 
appeared on the list for VB. We have not mentioned TRANS or 
VPRT. These came from the definition of the word "pick" when we 
called the function PARSE for a word (see section 2.U.k for 
details). 

Ordinarily the VG program checks for various kinds of tense 
and number, but In the special case of an IMPER VB, It returns 
Immediately after finding the verb. We will see other cases In 
the next example. 

When the VG program succeeds, CLAUSE takes over again. 
Since It has found the right kind of VG for an IMPERatlve 
CLAUSE, It puts the feature IMPER on the CLAUSE feature list. 
It then checks to see whether the MVB has the feature VPRT, 
Indicating It Is a special kind of verb which takes a particle. 
It discovers that "pick" is such a verb, and next checks to see 
If the next word Is a PRT, which It is. It then checks In the 
dictionary to see If the combination "pick up" Is defined, and 
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when It discovers this Is true. It calls (PARSE PRT) to add "up" 
to the parsing tree. Notice that we might have let the VG 
program do the work of looking for a PRT, but It would have run 
Into difficulties with sentences like "Pick the red block up." 
In which the PRT Is displaced. By letting the CLAUSE program do 
the looking, the problem Is simplified. 

As soon as It has parsed the PRT, the CLAUSE program marks 
the feature PRT on Its own feature list. It then looks at the 
dictionary entry for "pick up" to see what transitivity features 
are there. It Is TRANSltlve, which Indicates that we should 
look for one object -- 0BJ1. The dictionary entry does not 
Indicate that this Is a verb which can take special types of 
RSNG clauses as objects, so the object must be either a NG or a 
WHRS clause (which can appear wherever a NG can). If the object 
were a WHRS clause. It would begin with a relative pronoun, like 
"Pick uo what I told you to." Since the next word Is "a", this 
Is not the case, so the CLAUSE program looks for an object by 
calling (PARSE NG OBJ 0BJ1), asking the NG program to find a NG 
which can serve as an 0BJ1. The structure Is now: 



(CLAUSE MAJOR IHPER PRT) (pick up...) 

(VG IHPER ) (pick) 

(VB MVB INF TRANS PRT) pick 

(PRT) "P , 

(NG OBJ 0BJ1)« (...) 

Figure UQ -- Syntactic Structure 2 



The NG program Is started and notices that the upcoming 
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word Is a DET, "a". It calls (PARSE DET) to add ft to the 
parsing tree, then uses the function TRNSF to transfer relevant 
features from the DET to the entire NG. It Is Interested In the 
type of determination (DEF vs. I NDEF vs. QNTFR), and the number 
(NS vs. NPL). It also adds the feature DET to the NO to 
Indicate that It has a determiner. The feature list for the NG 
Is now: 

(NG OBJ OBJ1 DET INDEF NS) 
since "a" Is a singular Indefinite determiner. The NG program 
then notices the feature INDEF, and decides not to look for a 
number or an ordinal (we can't say "a next three blocks"), or 
for the OF construction ("a of them" Is Impossible). It goes on 
Immediately to look for an ADJectlve by calling (PARSE ADJ). 
When this succeeds with the next word "big", a simple program 
loop returns to the (PARSE AOJ) statement, which succeeds again 
with "red", on the next trip It falls, and sends the program on 
to look for a classifier, since "block" Isn't an ADJ. But 
"block" Isn't a CLASF either In our dictionary, so the NG 
program goes on to look for a NOUN, by calling (PARSE NOUN). 
This succeeds with the NOUN "block", which Is singular, and the 
program checks to see If It agrees with the number features 
already present from the determiner (to eliminate Illegal 
combinations like "these boy"). In this case, both are singular 
(NS), so the program Is satisfied. Ordinarily It would go on to 
look for qualifiers, but In this case there Is nothing left In 
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the sentence. Remember that we have an especially easy way of 
Indicating In a PROGRAHMAR program what action should be taken 
at any point If the sentence runs out. We can do It by simply 
putting a third direction In any branch statement. In this 
case, since we have found all of the basic constituents we need 
for a NG, the "third branch" tells us that the NG program should 
return success. If we had run out after the determiner. It 
would have sent us to check for an INCOMplete NG, while If we 
had run out after an ADJ It would have entered a backup program 
which would check to see whether It had misinterpreted a NOVN as 

an ADJ. 

In this case, the NG program returns, and the CLAUSE 
program similarly notices that the sentence has ended. Since a 
TRANS verb needs only one object, and that object has been 
found, the CLAUSE program marks the feature TRANS, and returns, 
ending the parsing. In actual use, a semantic program would be 
called here to understand and execute the command -- In fact, 
semantic programs would have been called at various points 
throughout the process. The final result looks like: 
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(CLAUSE MAJOR IMPER PRT TRANS) (pick up 3 big red block) 

(VG IMPER) (pick) 

(VB HVB INF TRANS VPRT) pick 



(PRT) 



up 



(NG OBJ 0BJ1 DET INDEF NS) (a big red block) 

(DET INOEF NS) a 

(ADJ) big 

(ADJ) rP d 

(NOUN NS) block. 

Figure <il -- Syntactic Structure 3 
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Now let us take a more complex sentence/ like; 

How many blocks are supported by the cube which I 
wanted you to pick up? 

We will not go Into as much detail/ but will emphasize the 
new features exhibited by this example. First, the parser 
recognl2es that this sentence Is a question by Its punctuation. 
It ends with a question mark. This "chatting" Is not really 
necessary, and In the future the grammar will be revised to look 
for the other signals of a question (for example, beginning with 
a determiner like "how many" or "which"). 

In any event, the feature QUESTION Is noted, and the 
program must decide what type of question It Is. It checks to 
see If the CLAUSE begins with a QADJ like "why", "where 11 , etc. 
or with a PREPosltlon which might begin a PREPG QUEST {like "In 
what year. . .") . 

All of these things fall In our example, so It decides the 
CLAUSE must have a NG as Its question element, (called NGQ), 
marks this feature, and calls (PARSE NG QUEST). The NO program 
starts out by noticing QUEST on Its Initial feature list, and 
looking for a question determiner (DET QDET) . Since there ere 
only three of these ("which", "what", and "how many"), the 
program checks for them explicitly, parsing "how" as a QDET, and 
then celling (PARSE NIL MANY), to add the word "many" to the 
parsing tree, without worrying about Its features. (The call 
(PARSE NIL X) checks to see If the next word Is actually the 
word "x")K 
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Since a determiner has been found. Its properties are added 
to the NG feature list, (In this case, (NUMDET INDEF NPL)), and 
the NG program goes on with Its normal business, looking for 
adjectives, classifiers, and a noun. It finds only the NOUN 
"blocks 11 with the features (NOUN NPL). The word "block" appears 
In the dictionary with the feature NS, but the Input program 
which recognized the plural ending changed NS to NPL for the 
form "blocks". Agreement Is checked between the NOUN and the 
rest of the NG, and since "how many" added the feature NPL, all 
Is well. This time, there Is more of the sentence left/ so the 
NG program continues, looking for a qualifier. It checks to 
see If the next word Is a PREPosItlon (as In "blocks jm the 
table), a relative word ("blocks whl<;h . . . ) , a past participle 
("blocks suppported by.,.), an I NG verb ("blocks sitting on,,,) 
a comparative adjective ("blocks bigger than...) or the word 
"as" ("blocks ££. big as...). If any of these are true. It tries 
to parse the appropriate qual I f y Ing phrase. If not. It tries to 
find an RSQ CLAUSE ("blocks xh£. block supports ) , In this case, 
all of these fall since the next word Is "are", so the NG 
program decides It will find no qualifiers, and returns what It 
a I ready has . Thl s gives us: 



(CLAUSE MAJOR QUESTION NGQ)* (how many blocks...) 

(NQ QUEST DET NUMDET NPL INDEF) (how many blocks) 

(DET QOET NPL INDEF) how 

() many 

(NOUN NPL) blocks 

Figure U2 — Syntactic Structure U 
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Next the CLAUSE program wants a VG, so It calls (PARSE VG 
NAUXK The feature NAUX Indicates that we want a VG which does 
not consist of only an AUXIUlary verb, like "be" or "have". If 
we saw such a VG, It would indicate a structure like "How many 
blocks are the boxes supporting?", in which the question NG Is 
the object of the CLAUSE. We are Interested In first checking 
for the case where the Question NG Is the subject of the CLAUSE. 

The VG program Is designed to deal with combinations of 
auxllliary verbs like "had been golne to be..." and notes that 
the first verb Is a form of "he". It calls (PARSE VB AUX BE), 
assuming that "are" Is an auxllliary rather than the main verb 
of the sentence (If this turns out wrong, there Is backup). It 
transfers the Initial tense and person features from this verb 
to the entire VG (The English VG always uses the leading verb 
for these features, as In "He has been.,.", where It Is "has" 
which agrees with "he") In this case "are" Is plural (VPL) and 
present tense (PRES). 

When "be" is used as an auxllliary. It Is followed by a 
verb In either the ING or the EN form. Since "supported" is an 
EN form (and was marked that way by the Input program). The VG 
program calls (PARSE VB EN (MVB)), marking "supported" as the 
main verb of the clause. The use of a "be" followed by an EN 
form Indicates a PASV VG, so the feature PASV Is marked, and the 
VG program Is ready to check agreement. Notice that so far we 
haven't found a SUBJect for this clause, since the QUESTIon NG 
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might have been an object, as In " frJow mm blocks does the box 
support? 11 However the VG program Is aware of this, and realizes 
that Instead of checking agreement with the constituent marked 
SUBJ, It must use the one marked QUEST. It uses PROGRAMMER'S 
pointer-moving functions to find this const I tuent, and notes 
that It Is NPL, which agrees with VPL. VG therefore Is happy 
and returns Its value* We now have: 



(CLAUSE MAJOR QUESTION NGQ)* (how many blocks are supported...) 

(NG QUEST DET NUMDET NPL INDEF) (how many blocks) 

(DET QOET NPL INDEF) how 

() many 

(NOUN NPL) blocks 

(VG NAUX VPL PASV (PRES)) (are supported) 

(VB AUX BE PRES VPL) &re 

(VB MVB EN TRANS) supported 

Figure t*3 -- Syntactic Structure 5 



The CLAUSE program resumes, and marks the feature SUBJQ, 
since It found the right kind of VG to Indicate that the NG "how 
many blocks* 1 Is Indeed the subject. It next checks to see If we 
have a PRT situation as we did In our first example. We don't, 
so It next checks to see If the VG Is PASV, and marks the clause 
with the feature PASV. This Indicates that there will be no 
objects, but there might be an AGENT phrase. It checks that the 
next word Is "by", and calls (PARSE PREPG AGENT). 

The PREPG program Is fairly simple ~ It first calls (PARSE 
PREP), then (PARSE NG OBJ PREPOBJK The word "by" Is a PREP, so 
the first call succeeds and NG Is called and operates as 
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described before, finding the DET "the" and the NOUN "cube", and 
checking the appropriate number features. In this case, "the" 
Is both NPL and NS, while "cube" Is only NS, so after checking 
the NG has only the feature NS. 

The NG program next looks for qualifiers, as described 
above, and this time It succeeds. The word "which" signals the 
presence of a RSQ WHRS CLAUSE modifying "cube". The NG prop.ram 
therefore calls (PARSE CLAUSE RSQ WHRS). The parsing tree now 
looks like; 



(CLAUSE MAJOR QUESTION NGQ SUBJQ PASV) 

(how many blocks are supported by the cube...) 

(NG QUEST DET NUMDET NPL INDEF) (bow many blocks) 

(DET QDET NPL INDEF) how 

() many 

(NOUN NPL) blocks 

(VG NAUX VPL PASV (PRES)) (are supported) 

(VB AUX BE PRES VPL) are 

(VB MVB EN TRANS) supported 

(PREPG AGENT) (by the cube...) 

(PREP) by 

(NG OBJ PREPOBJ DET DEF NS) (the cube...) 

(DET DEF NPL NS) the 

(NOUN NS) cube 

(CLAUSE RSQ WHRS)- (...) 

Figure kk -- Syntactic Structure 6 



The CLAUSE program Is Irrmedl latel y dispatched by the feature 
WHRS to look for a RELWD. It finds "which", and narks Itself as 
NGREL. It then goes on to look for a (VG NAUX) just as our 

QUESTION NGQ clause did above. Remember that WH- questions and 
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WHRS clauses share a great deal of the network, and they share 
much of the program as well. This time the VG program falls, 
since the next word Is "I", so the CLAUSF program decides that 
the clause "which I..." Is not a 5UBJRGL. It adds the temporary 
feature N5UBREL, Indicating this negative knowledge, but not 
deciding yet just what we do have. It then goes to the point In 
the normal clause program which starts looking for the major 
constituents of the clause — subject, verb, etc. We coll 
(PARSE NG SUBJ) and succeed with the PRONG M I M . We then look 
for a VG, and find "wanted". In this case, since the verb Is 
PAST tense. It doesn't need to agree with the subject (only the 
tenses beginning with PRES show agreement). The feature NAGR 
marks the non-appl IcabI 1 1 ty of agreement. The parsing tree from 
the WHRS node on down Is now: 



(CLAUSE RSQ WHRS NGREL NSUBREL)* (which I wanted...) 

(RELWD) which 

(NG SUBO PRONG NFS) (1) 
(PRON NFS) I 

(VG NAGR (PAST)) (wanted) 

(VB HVB PAST TRANS TOOBJ SUBTOBJ) wanted 

Figure ^5 -- Syntactic Structure 7 



The CLAUSE program notes that the HVB Is TRANS and begins 
to look for an 0BJ1. This time It also notes that the verb Is a 
TOOflJ and a SUBTOBJ (It can take a TO clause as an object, as In 
"I wanted ifi go. ", or a SUBTO, as In "I wanted you tp gfl," Since 
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the next word Isn't "to". It decides to look for a SUBTO clause, 
call Ins (PARSE CLAUSE RSNG OBJ 0BJ1 SUBTO). In fact, this 
checking for different kinds of RSNG clauses Is done by a small 
function named PARSEREL, which looks at the features of the MVB, 
and calls the appropriate clauses. PARSEREL Is used at several 
points In the grammar, and one of main advantages of writing 
grammars as programs Is that we can write such auxllllary 
programs (whether In PROGRAHMAR or LISP) to make full use of 
regular I ties In the syntax. 

The CLAUSE program Is called recursively to look for the 

SU3T0 clause "you to pick up", If finds the subject "you 11 , and 
calls (PARSE VG TO) since It needs a verb group of the "to" 
type. The VG program notices this feature and finds the 
appropriate VG (which Is again NAGR) . The PRT mechanism 
operates as described In the first example, and the bottom of 
our structure now looks like: 
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(CLAUSE RSQ WHRS NGREL NSUBREL) (which I wanted you to pick ud) 

(RELWD) which 

IfJG SUBJ PRONG NFS) (I) 

(PRON NFS) I 

(VG NAGR (PAST)) (wanted) 

(VB HVB PAST TRANS TOOBJ SUBTOBJ) wanted 

(CLAUSE RSNG SUBTO OBJ OBJ1 PRT)* (you to pick up) 

(NG SUBJ PRONG NPL) (you) 

(PRON NPL) you 

(VG TO NAGR) (to pick) 

() to 

(VB MVB INF TRANS VPRT) pick 

(PRT) up 

Figure it6 -- Syntactic Structure 8 



Notice that we have a transitive verb-particle combination, 
"pick up", with no object, and no words left In the sentence. 
Ordinarily this would cause the program to start backtracking — 
checking to see If the MVB Is also Intransitive, or If there Is 
some way to reparse the clause. However we are In the special 
circumstance of an embedded clause which Is somewhere on the 
parsing tree below a relative clause with an "unattached" 
relative. In the clause "which I told you to pick up",'IMs 
the subject, and the CLAUSE "you to pick up" Is the object. The 
"which" has not been related to anything. There is a small 
program named UPCHECK which uses PROGRAMMAR's ability to look 
around on the parsing tree. It looks for this special 
situation, and when it finds It does three things: 1) Mark the 
current clause as UPREL, and the appropriate type of UPREL for 
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the thing It Is missing (In this case 0BJ1UPREL). 2) Remove 
the feature NSUBREL from the clause with the unattached relative 
3) Replace It with DOWNREL to indicate that the relative has 
been found below. This can all be done with simple programs 
using the basic PROGRAMMAR primitives for moving around the 
tree (see section 2. fa. ID) and manipulating features at nodes 
(see 2, fa. 11). The Information which Is left In the parsing tree 
Is sufficient for the semantic routines to figure out the exact 
relationships between the various pieces Involved. 

In this example, once the CLAUSE "to pick up" has been 
marked as 0BJ1UPREL, It has enough objects, and can return 
success since the end of the sentence has arrived. The CLAUSE 
"which I want you to pick up" has an object, and has Its 
relative pronoun matched to something, so It also succeeds, as 
does the NG "the cube.., 11 , the PREPG "by the cube..", and the 
MAJOR CLAUSE. The final result Is shown In Figure b7. 

Even In this fairly length/ description, we have left out 
much of what was going on. For example we have not mentioned 
all of the places where the CLAUSE program checked for adverbs 
(like "usually" or "quickly"), or the VG program looked for 
"not", etc. These are all "quick" checks, since there Is a 
PROGRAMMAR command which checks the features of the next word. 
In following the actual programs, the course of the process 
would be exactly as described, without backups or other attempts 
to parse major structures. 
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(CLAUSE MAJOR QUESTION NGQ SUBJQ PASV AGENT) 

(NG QUEST DET NUMOET NPL INDEF) (how many blocks) 

(DET QDET NPL INDEF) how 

( ) many 

(NOUN NPL) blocks 

(VG NAUX VPL PASV (PRES)) (are supported) 

(VB AUX BE PRES VPL) are 

(VB HVB EN TRANS) supported 

(PREPG AGENT) (by the cube which I wanted you to pick up) 

(PREP) by 

(NG OBJ PREPOBJ DET DEF NS) 

(the cube which I wanted you to pick up) 

(DET DEF NPL NS) the 

(NOUN NS) cube 

(CLAUSE RSQ WHRS NGREL DOWNREL TRANS) 

(which I wanted you to pick up) 

(RELWD) which 

(NG SUBJ PRONG NFS) (I) 

(PRON NFS) I 

(VG NAGR (PAST)) (wanted) 

(VB MVB PAST TRANS TOOBJ SUBTOBJ) wanted 

(CLAUSE RSNG SUBTO OBJ OBJ1 PRT 

TRANS UPREL OBJ1UPREL) (you to pick up) 

(NG SUBJ PRONG NPL) (you) 

(PRON NPL) you 

(VG TO NAGR) (to pick) 

() to 

(VB MVB IMF TRANS VPRT) pick 

(PRT) up 



Figure U7 - Syntactic Structure 9 
"How many blocks are supported by the cube 
which I wanted you to pick up?" 
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This may seen like a quite complex process and complex 
grammar, compared to other systems, or even our ov/n examples In 
Sect Ion 2,2. This Is because language I s I ndeed a hlghl y 
complex phenomenon. We have tried to handle a great deal more 
of the complexity of English than any of the previous language- 
understanding systems. It Is only due to the fact that 
PROGRAMMAR gives us an easy framework In which to include 
complexity that It was at all possible to Include such a 
detailed grammar as only one part of a project carried out by e 
single person In less than two years. 
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Analysis flf Word Endings 

This section describes the "spelling rules" used by the 
program In recognizing Inflectional endings of words. For 
spoken language, these would be called the "morphophonemlc" 
rules, but since we deal with written language, they are 
"morpho-graphemlc." 

These rules enable a reader to recognize that, for example, 
"pleasing" Is a form of "please", while "beating" Is a form of 
"beat". There Is a structure of conventions for doubling 
consonants, dropping "e", changing "I" to "y", etc. when adding 
endings, and a corresponding set for removing them, 

A word like "running 11 need not have separate entry In the 
dictionary, since It Is a regular Inflected form of "run". The 
program can use an Interpretive procedure to discover the 
underlying form and attach the appropriate syntactic features 
for the Inflection. 

In designing a formalism for these rules. It seems most 
natural to express them as a program for interpretation. The 
flow chart In Figure ft8 Is designed to handle a number of 
Inflectional endings — "-n't" for negative, "-'s" and "- 1 " for 
possessive, n -s" and Its various forms for plural nouns and 
singular third-person verbs, "-ling", "*ed", and "-en" verb 

forms, the superlative "-est" and comparative "-er", and the 
adverbial "-ly". 

As the flowchart shows, these endings share many aspects of 
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morphographemlc structure, and the program representation Is 
able to capture these generalities as well as detailing those 
aspects peculiar to each ending. It Is not a complete 
description, but covers a wide variety of words, and could 
easily be expanded to treat more special cases. 

The description uses a few simple notations In addition to 
normal flowchart conventions, "No 11 and "ye$ n answers to branch 
conditions are represented by single and double arrows 
respectively. The function "cutoff" Indicates what Is to be 
cut off of the end of the word. The ordinals "1st", "2nd", etc. 
count letters from the end of the word backwards. Ignoring those 
which have been cut off. Several classes of letters are 
relevant to endings -- VOWEL Includes (A E I U Y), LIQUID Is 
(L R S V Z), and NOEND Is (CGS VZ). The label "TRY" 
represents the part of the program which tries looking up the 
supposed root In the dictionary. It uses whatever letters have 
not been "cut off". At some places, the program tries an 
Interpretation, then If that falls, carries out a different 
anal ys I s and tries again. 

The program can correctly analyze such roots as: bashes - 
bash, bathes - bathe, leaning - lean, leaving - leave, dented - 
dent, danced - dance, dogs - dog, kisses - kiss, curved - curve, 
curled - curl, rotting - rot, rolling - roll, played * play, 
piled - ply, realest - real, palest - pale, knives - knife, 
prettily - pretty, nobly - noble, etc. 
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It Is Important to note that exceptions do not need to be 
explicitly Included In the analysis program. The word "was" Is 
directly In the dictlonary / and the rules will therefore never 
be applied In trying to analyze It. The distinction between 
lexical Idlosyncracles and generalities Is empirical. A 
productive rule like the one relating "sane 11 and "sing", or 
"rang 11 and "ring" might well be Included In a more complete 
program, while less productive ones (such as relating "will" to 
"won't") will be left as separate dictionary entries. 

It Is tempting to see this program as a simple finite state 
machine, but this Is not the best formalism for several reasons. 
First, the tests which can be done to a word In deciding on a 
transition are not. In general, simple checks of the next Input 
letter. Whether a certain analysis Is possible may depend, for 
example, on how many syllables there are In the word, or on some 
complex phonological calculation Involving vowel shifts. 
Semantic and syntactic Information should be applied as well In 
many cases. 

The morphographemic analysis alone Is sufficient to do a 
great deal of the Interpretation of English endings. In fact, 
some systems UThorne 1969>> use It to avoid having a dictionary 
of open class words. The Inflection of the words In the Input 
determines their syntactic class possibilities. Of course, 
without a dictionary, they cannot attempt to deal with the 
meaning of words, beyond trying to match them with other 
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occurences of Che same root. 

People rarely operate at this level of Ignorance, They use 
their lexical knowledge to realize that "under" Is not a 
comparative form of some adjective "und", and that "bely" Is not 
an adverbial form of "be". This knowledge can readily be 
Integrated Into the Interpretive program. Once a possible 
morphographemlc analysis Is found, the hypothetical root can be 
checked In the dictionary. If It Is not there, the program can 
try another analysis. Notice that In the flow chart, words 
ending In a double LIQUID, followed by an Inflection, are first 
tried In the form with the doubled consonant (as In "rolling" - 

"roll"), then If that falls, another try Is made with the single 
consonant ("patrolled" - "patrol"). 

If the root Is found In the dlctloary. Its syntactic class 
Is checked to see If the ending Is appropriate (e.g. a "-est" 
ending can apply only to an ADJ). The list of syntactic 
features Is changed to take the ending Into account. For 
example, the ending "-5" changes the feature HS (singular noun) 
to NPL (plural noun) and the feature INF (Infinitive verb) to 
V3PS (third-person singular verb). A word which fits Into more 
than one category may have more than one set of features 
changed. In determining all of the possible Interpretations. 

The problems Involved In analyzing more complicated 
endings, such as the "-tlon", "-ment", and "-er M endings of 
noml nail zat Ions will be discussed In a later paper. 
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2.1* Programming Details 

2^-1 Operation fi£ ±h£ SVSt?™ 

Since the grammar Is Itself a program, there Is not much 
overhead mechanism needed for the basic operation of the parser. 
Instead, the system consists mostly of special functions to be 
used by the grammar. The system maintains a number of global 
variables, and keeps track of the parsing tree as It Is built by 
the main function, PARSE. When the function PARSE Is called for 
a UNIT which has been defined as a PROGRAMMAR program, the 
system collects Information about the currently active node, and 
saves it on a pushdown list. It then sets up the necessary 
variables to establish a new active node, and passes control to 
the PROGRAMMAR program for the appropriate unit. If this 
program succeeds, the system att-aches the new node to the tree, 
and returns control to the node on the top of the POL. If It 
falls. It restores the tree to Its state before the program was 
called, then returns control. A PROGRAMMAR program Is actually 
converted by a simple compiler to a LISP program and run In that 
form. The variables and functions available for writing 
PROGRAMMAR programs are described In the rest of part 2.fc. In 
order to make these details more Independent of our detailed 
grammar of English, we will continue to use a simplified grammar 
whenever possible. We use the hypothetical grammar begun In 
2.2, and try to use full length feature names for easier 
understanding. 
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When the function PARSE Is -called with a first argument 
which has not been defined as a PROGRAMMAR program, It checks to 
see whether the next word has all of the features listed In the 
arguments. If so. It forms a new node pointing to that word, 
with a list of features which Is the Intersection of the list of 
features for that word with the allowable features for the word 
class Indicated by the first argument of the call. For example, 
the word "blocks 11 will have the possibility of being either a 
plural noun or a thl rd-person-s I ngular present-tense verb. 
Therefore, before any parsing It will have the features (NOUN 
VERB N-PL VB-3PS TRANSITIVE PRESENT). If the expression (PARSE 
VERB TRANSITIVE) Is evaluated when "blocks" Is the next word In 
the sentence to be parsed, the feature list of the resulting 
node will be the Intersection of this combined list with the 
list of allowable features for the word-class VERB, If we have 
defined: 

(DEFPROP VERB (VERB INTRANSITIVE TRANSITIVE PRESENT PAST 
VB-3PS VB-Pl) ELIM), 

the new feature list will b« (VERB TRANSITIVE PRESENT VB- 
3PS). (ELIM Is simply a property Indicator chosen to Indicate 
this list which ELIMInates features). Thus, even though words 
may have more than one part of speech, when they appear In the 
parsing tree, they will exhibit only those features relevant to 
their actual use In the sentence. 
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2-^2 Special Words 

Some words must be handled In a very special way In the 
grammar. The most prevalent are conjunctions, such as "and" and 
"but". When one of these Is encountered, the normal process Is 
Interrupted and a special program Is called to decide what steps 
should be taken In the parsing. This Is done by giving these 
words the grammatical features SPEC or SPECL. Whenever the 
function PARSE Is evaluated, before returning It checks the next 
word In the sentence to see If It has the feature SPEC. If so, 
the SPEC property on the property list of that word Indicates a 
function to be evaluated before parsing continues. This program 
can in turn call PROGRAMHAR programs and make an arbitrary 
number of changes to the parsing tree before returning control 
to the normal parsing procedure. SPECL has the same effect, but 
Is checked for when the function PARSE Is called, rather than 
before It returns. Various other special variables and 
functions allow these programs to control the course of the 
parsing process after they have been evaluated. By using these 
special words. It is possible to write amazingly simple and 
efficient programs for some of the aspects of grammar which 
cause the greatest difficulty. This Is possible because the 
general form of the grammar Is a program. 

For example, "and" can be defined as a program which Is 
diagrammed: 
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Parse a unit of the same type 
as the currently active node * 



i 



-^Return fa 1 lure 



Replace the node with a new node 
combining the old one and the one 
you have just found 



* 



Return success 

Figure kO « Conjunction Program 



For example/ given the sentence "The giraffe ate the apples 
and peaches." the program would first encounter "and 11 after 
parsing the NOUN apples* It would then try to parse a second 
NOUN, and would succeed, resulting in the structure: 



SENTENCE 



NP 

\ 

DETERMINER NOUN VERB 

1 I 

giraffe ate 




NOUN 






DETERMINER 

JL 



NOUN 



NOUN 

apples and peaches 



Figure 50 -- Conjoined Noun Structure 



If we had the sentence, "The giraffe ate the apples and 
drank the vodka.' 1 the parser would first try the same thing. 
However, "drank" Is not a NOUN, so the AND program would fall 
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and the NOUN "apples" would be returned unchanged. This would 
cause the NP "the apples" to suc-ceed, so the AND program would 
be called asaln. It would fall to find a NP beginning with 
"drank", so the NP "the apples" would be returned, causing the 
VP to succeed. This time, AND would try to parse a VP and would 
find "drank the vodka". It would therefore make up a combined 
VP and cause the entire SENTENCE to be completed with the 
structure; 



SENTENCE 



DET NOUN VERB 

I I 1 

the giraffe ate 




DET 

the apples and drank the vodka 

Ft cure 51 — Conjoined Clauses 



The program tc actually do this would take only 3 or k 
lines In a PROGRAMMAR grammar. In the actual systen. It Is more 
complex as It handles lists (like "A, B, and C") other 
conjunctions (such as "but") and special constructions (such as 
"both A and B"). The conjunction program Is called by all of 
the conjunctions, the words "either", "neither", and "both", and 
the mark "," which appears as a separate word In the Input. 

The function •* Is used to look ahead for a repetition of 
the special word, as In ". . .and . . .and, . .". If one Is found, a 
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unit of the type most recently parsed Is parsed again, trying to 
extend all the way to the repeated conjunction or comma* This 
Is Iterated as long as there ar& repetitions, with special 
checks made for structures like "A, B, and C n or "A and B && 
Bflt C", As each new node Is parsed. Its structure Is saved, and 
when the last Is found, a new node Is created for the compound* 
Its features combine those for the type of conjunction with 
those appropriate for the type of unit (e.g, a compound NO 
connected with "and" Is £lven the feature "plural" (NPU.) The 
list of constituent structures Is put on the tree as a list of 
subnodes of the conjoined structure, which then replaces the 
original unit on the parsing tree. 

Compounds with a preceding word like "both" are parsed 
differently, since the word Is encountered before any unit has 
been parsed. In this case It Is possible to adopt the more 
general philosophy of attempting the longest possible unit 
first. These words have a SPECL definition, so the program Is 
called as the next unit Is about to be parsed. The conjunction 
program looks for the matching conjunction ("and" with "both", 
"or" with "either", and "nor" with "neither") and tries to parse 
the unit extending only to the conjunction. If this succeeds, 
the normal conjunction procedure Is followed. If not, some sub- 
component Is the conjoined one, and nothing happens until the 
parser attempts a sub*unlt, when the process Is repeated. 

A SPECL program can modify the parsing In several ways. 
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For example It can call the function FLUSHME, which simply 
removes the word from the Input sentence (I.e. It Is Ignored). 
It can take arbitrary actions on the current parsing tree, can 
Indicate to PROGRAMMAR that It should SKIP parsing the unit and 
use Instead results provided by the SPECL program/ or It can 
Indicate an action to be taken after the normal parsing Is DONE. 
Finally, a SPEC or SPECL program can abort the entire parsing. 
Indicating a response to the user. For example, the word 
"thank" calls a SPECL program which checks to see If the next 
word Is "you". If so, the parsing Is clven up, and the system 
replies "YOU'RE WELCOME' 1 . Currently there Is no backup 
procedure to modify the Interpretation of an ambiguous structuri 
like M A and B or C". This will In fact be parsed as (A and (B 
or C)>. Notice that "either A and B or C" will be parsed 
correctly as ((A and B) or C). 

The exact format for a SPEC or SPECL definition Is a LISP 
list to which will be appended two Items -- the Initial feature 
list of the unit being parsed and an Indicator of whether It Is 
a word or a unit which called the program. The resultant form 
Is then EVALIed, 
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2-fc-S Possesslves 

One of the best examples of the advantages of procedural 
grammars Is the ability to handle left-branching structures like 
possess! ves. In a normal top-down parser, these present 
difficulties, since any NG can begin with a possessive NG, which 
can In turn begin with a possessive NG, etc, as In "my mother's 
sister's student's cat's fur". Special care must be taken to 
avoid infinite loops. 

In our grammar this Is handled by a check after the NOUN or 
PRONOUN Is found In a NG. If It has the feature "possessive" 
(POSS) (e.g. W or "block's' 1 ) a node Is created for the NG 
thus far parsed, and this Is placed on the tree as a constituent 
(the determiner) of a NG to be continued. The program then 
returns to the point where It was after finding a determiner, 
and continues looking. This can happen any number of times, but 
In each case Is triggered by the presence of another POSSessIve 
word. It therefore loops only as much as necessary. This 
departure from top-down parsing Involves no changes to the 
parser, and only a simple loop in the program. Any other left- 
branching structure can be handled similarly. 



- f if - -< . ■ -,», 



••' 
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UlJL Iilfi Dictionary 

Since PROGRAMMAR Is embedded In LISP, the facilities of 
LISP for handling atom names are used directly. To define a 
word, a list of grammatical features Is put on Its property list 
under the Indicator WORD, and a semantic definition under the 
indicator SMNTC. Two facilities &re Included to avoid having to 
repeat Information for different forms of the same word. First, 
there Is an alternate way of defining words, by using the 
property Indicator W0RD1. This Indicates that the word given Is 
an Inflected form, and Its properties are a modified form of the 
properties of Its root. A W0RD1 definition has three elements, 
the root word, the list of features to be added, and the list of 
features to be removed. For example, we might define the word 
"go" by: (DEFPROP GO (VERB INTRANSITIVE INFINITIVE) WORD) We 
could then define "went" as (DEFPROP WENT (GO 
(PASTHINFINITIVE)) W0RD1) This Indicates that the feature 
INFINITIVE Is to be replaced by the feature PAST, but the rest 
(Including the semantic definition) Is to remain the same as for 

The other facility Is an automatic system which checks for 
simple modifications, such as plurals, n -Ing," forms, "-er" and 
"-est" forms and so forth. If the word as typed In Is not 
defined, the program looks at the way It Is spelled, tries to 
remove Its ending (taking Into account rules such as changing 
"running" to "run", but "buzzing" to "buzz"). It then tries to 
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find a definition for the reduced root word, and If It succeeds. 
It makes the appropriate changes for the ending (such as 
changing the feature SINGULAR to PLURAL). The program which 
does this Is not a part of the PROGRAMMAR system but Is 
specifically built for English. It Is described In section 
2.3,12. 

Everything else described In this section Is designed 
generally for the parsing of any language. In any particular 
language, this Input funtlon would have to be written according 
to the special rules of morphographemlc structure. The 
requirement for such a program Is that Its output must be a 
list, each member of which corresponds to a word In the original 
sentence, and Is In the form described In section 2.U.8. This 
list Is bound to the variable SENT, and Is the way In which 
PROGRAMMAR sees Its Input. 

The other form of data In the dictionary Is tables of verb- 
particle and preposition-preposition combinations like "pick up* 1 
or "on top of". The table Is stored on the property list of the 
Initial word under the Indicator PRTT or PREPP respectively. It 
Is an association list, each member of which has the second word 
of the combination as Its CAR, and a word parsing structure (see 
section 2.1*. 8) as Its CADR. There may be more than one 
combination for the same Initial word (e.g. "pick up", "pick 
out"), and a three-word combination can be defined by making the 
CADR be an association list of the same form for a third word- 
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Uui Backup FatHUlB? 

As explained In section 2,2.7, there Is no automatic 
backup, but there are a number of special functions which can be 
used In writing grammars. The simplest, (POPTO X) simply 
removes nodes from the tree. The argument is a list of 
features, and the effect Is to remove daughters of the currently 
active node, beginning with the rightmost and working leftward 
until one Is reached with all of those features, (POP X) Is the 
same, except that It also removes the node with the Indicated 
features. If no such node exists, nei ther f unct Ion takes any 
action. (POP) Is the same as (POP NIL), and a non-nil value Is 
returned by both functions if any action has been taken. 

A very Important feature Is the CUT variable. One way to 
do backup Is to first try to find the longest possible 
constituent at any point, then If for any reason an Impasse Is 
reached, to return and try again , limiting the consltuent from 
going as far along In the sentence. For example. In the 
sentence "Was the typewriter sitting on the cake? 11 , the parser 
will first find the auxllllary verb "was", then try to parse the 
subject. It will find the noun group "the typewriter sitting on 
the cake", which In another context might well be the subject 
("the typewriter sitting on the cake Is broken."). It then 
tries to find the verb, and discovers none of the sentence Is 
left. To back up. It must change the subject. A very clever 
program would look at the structure of the noun group and would 



Section 2.U. 5 - Pane 187 



realize that the modifying clause "sitting on the cake 11 must be 

dropped, A more simple-minded but still effective approach 

would use the following Instructions: 

(** K PW) 

(POP) 

((CUT PTW)SUBJECT (ERROR)) 

The first command sets the pointer PTW to the last word In 
the constituent On this case, "cake' 1 ). The next removes that 
constituent. The third sets a special pointer, CUT to that 
location, then sends the program back to the point where It was 
looking for a subject. It would now try to find a subject 
again, but would not be allowed to go as far as the word "cake". 
It might now find "the typerwrlter sitting," an analog to "The 
man sitting Is my uncle." If there were a good semantic 
program. It would realize that the verb "sit" cannot be used 
with an inanimate object without a location specified. This 
would prevent the constituent "the typewriter sitting" from ever 
being parsed. Even If this does not happen, the program would 
fall to find a verb when It looked at the remaining sentence, 
"on the cake." By going through the cutting loop again. It would 
find the proper subject, "the typewriter," and would continue 
through the sentence. 

Once a CUT point has been set for any active node, no 
descendant of that node can extend beyond that point until the 
CUT Is moved. Whenever a PROGRAMMAR program Is called, the 
variable END Is set to the current CUT point of the node which 
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called It, The CUT point for each constituent Is Initially set 
to Its END. When the function PARSE Is called for a word. It 
first checks to see If the current CUT has been reached (I.e. N 
and CUT are the same), and If so It falls. The third branch In 
a three-dl rect Ion branch statement Is taken If the current CUT 
point has been reached. The CUT pointer Is set with the 
function CUT of one argument. 
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1JU& Auxll 1 larv Functions 

Since PROGRAMMAR grammars are programs, they can call 
subroutines just as any other program. These subroutines can 
use PROGRAMMAR primitives like PARSE and *, as well as returning 
values for use In the main program. One example used In our 
grammar Is UPCHECK, used to see If the current node Is embedded 
In a structure which could be an UPREL like "the man I wanted 
you to see." It Is used In conjunction with UPMOD which makes 
the appropriate changes to the parsing tree. They both use 
primitives like * to find and change the elements. 

In order to simplify the search for rank-shifted clauses, a 
function PARSEREL was written* It takes as arguments a list of 
clause types (like REPORT, ING, etc.) a corresponding list of 
features to look for an the main verb, a pointer to that verb, 
and the rest of the Information to be Included In the call to 
PARSE. PARSEREL then loops through these lists, attempting to 
parse various types of RSNG clauses If they are In accord with 
the restrictions associated with the verb and the use of the 
clause In the sentence. It uses the function PARSE to modify 
the parsing tree before returning to the main CLAUSE program. 
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UuZ Messages 

To write good parsing pro^r^ms, we may at times want to 

know why a particular PROGRAMMAR program failed, or why a 

certain pointer command could not be carried out. In order to 

facilitate this, two message variables are kept at the top level 

of the system, MES, and MESP. Messages can be put on MES In two 

ways, either by using the special failure directions In the 

branch statements (see section 2.2,5) or by using the functions 

M and MQ, which are exactly like F and FQ, except they put the 

Indicated feature onto the message list HE for that unit. When 

a unit returns either failure or success, MES Is bound to the 

current value of ME, so the calling program can receive an 

arbitrary list of messages for v/hatever purpose It may want 

them. MESP always contains the last failure message received 

from ** or *. 
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2-JU& Ih£ Form of It* Parsing Tree 

Each node Is actually a list structure with the following 
Information: 

the list of features associated with the node 
NB the place In the sentence where the constituent 

begins 
N the place Immediately after the constituent 

H the subtree below that node (actually a list of 

Its daughters In reverse order, so that 

H points to the last constituent parsed) 
SM a space reserved for semantic Information 

These symbols can be used In two ways. If evaluated as 

variables, they will always return the designated Information 

for the currently active node. C Is always a pointer to that 

node. If used as functions of one argument, they give the 

appropriate values for the node pointed to by that argument; so 

(NB H) gives the location In the sentence of the first word of 

the last constituent parsed, whMe (FE(NB H)) would give the 

feature list of that word. 

Each word In the sentence Is actually a list structure 

containing the k I terns i 

FE as above 

SMWORD the semantic definition of the word 

WORD the word Itself (a pointer to an atom) 

ROOT the root of the word (e.g. "run" If the 

word Is "running") . 
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2.h.9 Variables Maintained bv the System 

There are two types of variables, those bound at the top 

level, and those which are rebound every time a PROGRAMMAR 

program is cai led. 

Variables bound at the top level 

N Always points to next word In the 

sentence to be parsed 
SENT Always points to the entire sentence 

PT PTW Tree and sentence pointers. 

See Section 2. k. 10 
MFS MESP List of messages passed up from lower 

levels. See Section 2.1*. 7 

Special variables bound at each level 

C FE MB 5M H See section 2.d.8 

NN CUT END See section 2. U.S. NN always 

equals (N0TCE0 N CUT)) 

UNIT the name of the currently active 

PROGRAMMAR program 

REST the list of arguments for the call 

to PARSE (These forrr the Initial 
feature list for the node, but as 
other features are added, REST 
continues to hold only the original 
ones , ) 

Tl T2 T3 Three temporary PROG variables for use 

by the program In any way needed. 

MVB Bound only when a CLAUSE Is parsed 

used as a pointer to the main verb 

ME List of messages to be passed up to 

next level See Section 2.H.7 



Section 2.U.10 - Page 195 



2-1.10 Pointers 

The system always maintains two pointers/ PT to a place on 
the parsing tree, and PTW to a place In the sentence. These ar* 
moved by the functions * and *• respect I vel y, as explained In 
section 2.2.10. The Instructions for PT are: 



C set PT to the currently active node 

H set PT to most recent {rightmost) daughter of C 

DL (down-last) move PT to the rightmost daughter 

of I ts current value 
DLC (down-last completed) like DL, except It only 

moves to nodes which are not on the push-down 

1 1st of active nodes. 
D p (down-first) like DL, except the leftmost 

PV (previous) move PT to Its left-adjacent sister 

NX (next) move PT to Its right-adjacent sister 

U (up) move PT to parent node of Its current value 

N Move PT to next word In sentence to be parsed 

The pointer PTW always points to a place In the sentence. 

It Is moved by the function *• which has the same syntax as *, 

and the commands: 

N Set PTW to the next word In the sentence 

FW (first-word) set PTW to the first word of the 

constituent pointed to by PT 
LW (last-word) 1 Ike FW 

AW (after-word) like FW, but first word after the 

const I tuent 
NW (next-word) Set PTW to the next word after Its 

current value 
PW (previous-word) like NW 

SFW (sentence-f Irst-word) set PTW to the first word 

In the sentence 
SLW (sentence-last-word) like SFW 

Since the pointers are bound at the top level, a program 

which calls others which move the pointers may want to preserve 

their location, PTW Is a simple variable, and can be saved with 

a SETQ, but PT operates by keeping track of the way It has been 
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moved. In order to be able to retrace Its steps. This Is 
necessary since LISP lists are threaded In only one direction 
{in this case, from the parent node to Its daughters, and from a 
right sister to Its left sister). The return path Is bound to 
the variable PTR, and the command (PTSV X) saves the values of 
both PT and PTR under the variable X, while (PTRS X) restores 
both values. 



Section 2.1**11 - Pag* 195 



ZJlAI Fq?turq Manipulating 

As explained In section 2.2.6, we must be able to attach 
features to nodes In the tree. The functions F, FQ, and TRNSF 
are used for putting features onto the current node, while R and 
RQ remove them. (F A) sets Che feature list FE to the union of 
Its current value with the list of features A. (FQ A) adds the 
single feature A (I.e. It quotes Its argument), (TRNSF A B) was 
explained In Section 2.2.7, R and RQ are Inverses of F and FQ. 
The functions ISX, ISQ, CQ, and NQ are used to examine features. 
If A points to a node of the tree or word of the sentence, and B 
points to a feature/ (ISX A B) returns non-nil If that node has 

that feature. (ISQ A B) Is equivalent to (IS A (QUOTE B)), (CQ 
B) Is the same as (ISQ C B) (where C always points to the 
currently active node), and (NQ B) Is the same as (ISQ N B) (N 
always points to the next word In the sentence left to be 
parsed) . 

The function NEXTH checks to see of the root of the next 
word matches the argument. (NEXTW BE) evaluates to non-Nil only 
If the next word is some form of the verb "be". PUTF and REMF 
are used to add and remove features from some node other than 
the current one. They &re FEXPRS whose argument Is a list of 
features, which are put on or removed from the node currently 
pointed to by the pointer PT, 
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2.5 Comparison with Other Parsers 

?-S.i Q'der parsers 

When work first began on analyzing natural language with 
computers, no theories of syntax existed which were explicit 
enough to be used. The early machine-translator designers were 
forced to develop thplr own linguistics as they worked, and they 
produced rough and ready versions. The parsers were collections 
of "packaging routines 1 ', "InsertRd structure passes 1 ', "labeling 
subroutines", etc. {see <GarvIn>) which evolved gradually as 
the grammars were expanded to handle more and more complex 
sentences. They had the same difficulties as any program 
designed In this way — as they became more complex It became 
harder and harder to understand the Interactions within them. 
Making extensions which were Intended to deal with a limited 
anticipated set of Inputs tended to make It difficult to extend 
the system later. 

When the machine-translation effort failed. It seemed clear 
that It had been premature to try handling all of English 
without a better background of linguistic theory and an 
understanding of the mathematical properties of grammars. 
Computer programs for natural language took two separate paths. 
The first was to Ignore traditional syntax entirely, and to use 
some sort of more general pattern matching process to get 
Information out of sentences. Systems such as STUDENT, SIR, 
ELIZA, and Semantic Memory made no attempt to do a complete 
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syntactic analysis of the Inputs. They either limited the user 

to a small set of fixed Input forms or limited their 

understanding to those things they could get while Ignoring 

syntax. 

The other approach was to take a simplified subset of 

English which could be handled by a we) 1 -understood form of 

grammar, such as one of the variations of context-free grammars. 

There has been much Interesting research on the properties of 

abstract languages and the algorithms needed to parse them. 

Using this theory, a series of parsing algorithms and 

representations were developed. For a summary of the computer 

parsers designed before 1966, see <Bobrow 196U>. A more recent 

development was Early's context-free parser <Early> which 

operates In a time proportional to the cube of the length of a 

sentence. 

The problem faced by all of these parsers (Including the 
mammoth Harvard Syntactic Analyzer (<Kuno>). Is that such 
simple models are not adequate for handling the full complexity 
of natural language- This Is discussed theoretically In 
<Chomsky 1957> but for our purposes It Is more important to note 
that many aspects which could theoretically be handled would be 
Included only at the expense of gross Inefficiency and 
unnecessary complexity. 

Several people attempted to use Chomsky's transformational 
grammar as the basis for parsers, (see <PetrIck> and <ZwIcky>) 
They tried to "unwind* 1 the transformations to reproduce the deep 
structure of a sentence, which could then be parsed by a context 
free "base component". It soon became apparent that this was a 
very difficult task. Although transformational grammar Is 
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theoretically a "neutral" description of language. It Is In 
fact highly biased toward the process of generating sentences 
rather than interpreting them. Adapting generation rules to use 
In Interpretation Is relatively easy for a context-free &rammar, 
but extremely difficult for transformational grammars. <Woods 
1969> discusses the problems of "combinatorial explosion" 
Inherent In the Inverse t ransfornat lonal process. The 
transformational parsers have not gone beyond the sta^e of 
handling small subsets of English In an Inefficient way. 
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? ■ 5 ■ 2 A^ftmsnted Transl t Iqp NetworKs 

In the past two years, three related parsing systems have 
been developed to deal with the full complexity of natural 
language. The first was by Thorne, Bratley, and Dewar UThorne 
1968 and 1969>), and the more recent ones are by Bob row and 
Fraser (<Bobrow 19G9>) and Woods (<Woods 1969>). The three 
programs operate In v&ry similar ways, and since Woods* is the 
most advanced and best documented, we will use It for 
comparison. In his paper Woods compares his system with the 
other two. 

The basic Idea of these parsers Is the "augmented 
trans 1 1 ion network 11 . The parser Is seen as a transition network 
much II ke a finite-state recognizer used for regular 1 anguages 
In automata theory. 

The first extension Is In allowing the networks to make 
recursive calls to other networks (or to themselves). The 
condition for following a particular state transition Ik not 
limited to examining a single Input symbol. The condition on 
the arc can be something like "NP" where NP Is the name of an 
Initial state of another network. This recursively called NP 
network then examines the Input and operates as a recognizer. 
If It ever reaches an accepting state. It stops, and pars I ng 
continues from the end of the UP arc In the original network. 
These "recursive transition networks" have the power of a 
context-free grammar, and the correspondence between a network 
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and Its equivalent grammar Is quite simple and direct. 

To parse the full range of natural language, we need a 
critical addition. Instead of using "recursive transition 
networks" these parsers use "augmented transition networks", 
which can "make changes In the contents of a set of registers 
associated with the network, and whose transitions can be 
condi t lonal on the contents of those re£l sters, (< Woods 1969>) . 
This Is done by "adding to each arc of the transition network an 
arbitrary condition which must be satisfied In order for the arc 
to be followed, snd a set of structure building actions to be 
executed If the arc Is followed." 

Augmented transition networks have the power of Turing 
machines (since they have changeable registers and can transfer 
control depending on the state of those registers). Clearly 
they can handle any type of granmar which could possibly be 
parsed by any machine. The advantages lie In the ways In which 
these augmented networks are close to the actual operations of 
language, and give a natural and understandable representation 
for grammars. 
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3.5.5 Networks and PrgRrams 

How does this type of parser compare with PROGRAMMAR? Is 

there anything In convnon between grammars which are networks and 

grammars which are programs? The reader may have already seen 

the "joke 11 In this question. In fact these are just two 

different ways of talking ahout doing exactly the same thlngl 

Picture a flowchart for a PROGRAMMAR grammar. In which 
calls to the function PARSE are drawn on the arcs rather than at 
Che nodes. Every arc then Is either a reouest to accept the 
next word In the Input (when the argument of PARSE Is a word 
class), or a recursive call to one of the granmar programs. At 
each node (Ke. segment of program between conditionals and 
PARSE calls) we have "a set of arbitrary structure building 
actions*" Our flowchart Is, just like an augmented transition 

network. 

Now picture how Woods' networks are fed to the computer. 
He uses a notation (see <Woods 1969> p. 17) which looks very 
much like a L I SP-embedded computer language, such as PROGRAMMAR 
or PLANNER, In fact, the networks could be translated almost 
directly Into PLANNER programs (PLANNER rather than LISP or 
PROGRAMMAR because of the automatic backup features « see 
discussion below). 

It Is an Interesting lesson In computer science to look at 

Woods 1 discussion of the advantages of networks, and "translate 11 

them Into the advantages of programs. For example, he talks 

about efficiency of representation. M A major advantage of the 

transition network model Is. ..the ability to merge the common 

parts of many context free rules." 

Looking at grammars as programs, vie can call this "sharing 

subroutines". Me says "The augmented transition network, 

through Its use of flags allows for the merging of similar parts 

of the network by recording Information In registers and 

Interrogac Ing It. ..and to merge states whose transitions are 
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similar except for conditions on the contents of the registers," 
This Is the use of subroutines with parameters. In addition, 
the networks can "capture the regularities of the 
language. . .whenever there are two essentially Identical parts of 
the grammar which differ only In that the finite control part of 
the machine Is remembering some piece of Informat Ion. . . I t Is 
sufficient to explicitly store the distinguishing piece of 
Information In a register and use only a single copy of the 
subgraph." This Is clearly the use of subroutines with an 
argument I 

Similarly we can go through the arguments about efficiency, 
the ease of mixing semantics with syntax, the ability to Include 
operations which are "natural" to the task of natural language 
analysis, etc. All of them apply Identically whether we are 
looking at "transition networks 11 or "programs' 1 . 

What about "perspicuity"? Woods claims that augmented 
transition networks retain the perspicuity (ease of reading and 
understanding by humans) of simpler grammar forms. He says that 
transformational grammars have the problem that "the effect of a 
given rule is Intimately bound up with Its Interrelation to 
other rules., .It may roqulre an extremely complex analysis to 
determine the effect and purpose*" (<Woods 1<)69> p. 38) This Is 
true, but It would also be true for any grammar complex enough 
to handle all of natural language. The simple examples of 
transition networks are Indeed easy to read (as are simple 
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examples of most grammars), hut In a network for a complete 

language, the purpose of a given state would be Intimately bound 

up with Its Interrelation to other states, and the grammar will 

not be as "perspicuous" as we might hope. This Is Just as true 

for programs, but no more so. If we look at the flow chart 

Instead of the listing, programs are equally perspicuous to 

networks. 

If the basic principles are really the same, are there any 

differences at all between Woods' system and ours? The answer 

Is yes, they differ not In the theoretical power of the parser, 

but In the types of analysis belnj* carried out. 

The most Important difference Is the theory of grarwiar 
being used. All of the network systems are based on 
transformational grammar. They try to reproduce the "deep 
structure" of a sentence while doing surface structure 
recognition. This Is done by using special commands to 
explicitly build and rearrange the deep structures as the 
parsing goes along. PROGRAMMAR is oriented towards systemic 
grammar, with Its Identification of significant features In the 
constituents being parsed. It therefore emphasizes the ability 
to examine the features of constituents anywhere on the parsing 
tree, and to manipulate the feature descriptions of nodes* 

In section 2.1 we discussed the advantages of systemic grammar 

for a language understanding system. Either type of parser 

could be adapted to any type of grammar, but PROGRAMMAR was 

specially designed to Include "natural" operations for systemic 

understanding of sentences. 

A second difference Is in the Implementation of special 

additions to the basic parser. For example In section 2.4.2 we 

discussed the way In which words like "and" could be defined to 
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act as "demons" which Interrupt the parsing at whatever point 
they are encountered, and start a special program for 
Interpreting conjoined structures. This has many uses, both In 
the standard parts of the grammar (such as "and") and In 
handling Idioms and unusual structures. If we think In network 
terms, this Is like having a separate arc marked "and" leading 
from every node In the network. Such a feature could probably 
be added to the network formulation, but It seems much more 
natural to think In terms of programs and Interrupts. 

A third difference Is the backup mechanism. The network 
approach assumes some form of nondetermlnlsm. If there are 
several arcs leaving a node, there must be some way to try 
following all of them. Either we have to carry forward 
simultaneous interpretations, or keep track of our choices in 
such a way that the network can automatically revise Its choice 
If the original choice does not lead to an accepting state. 
This could be done In the program approach by using a language 
such as PLANNER with Its autonatlc backup mechanisms. But In 
section 2.2.7 we discussed the question of whether It Is even 
desirable to do so In handling natural language. 

We pointed out the advantage of an Intelligent parser which 
can understand the reasons for Its failure at a certain point, 
and can guide Itself accordingly Instead of backing up blindly. 
This Is Important for efficiency, and Woods Is very concerned 
with ways to modify the networks to avoid unnecessary and 
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wasteful backup by "making the network more determlnls tic." 
(<Woods 1969> p. h5). It might be Interesting to explore a 
compromise solution In which automatic backup facilities 
existed, but could be turned on and off. We could do this by 
giving PROGRAWMAR special commands which would cause It to 
remember the state of the parsing so that later the grammar 
could ask to back up to that state and try something else. This 
Is an Interesting area for further work on PROGRAMMAR. 

It Is difficult to compare the performance of different 
parsers since there Is no standard grammar or set of test 
sentences. Bobrow and Woods have not published the results of 
any experiments with a large grammar, but Thorne has published 
two papers UThorne 1968, 1969>) with a number of sample 
parsings. Our system, with Its current grammar of English has 
successfully parsed all of these examples. They took from 1 to 
5 seconds apiece. Some samples of more complicated parsings 
done by the system are given In Appendix B. 



* 
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CHAPTER 3. Inference 



3.1 Basic Approach to Meaning 
3,1,1 Representing Knowledge 

We have described the process of understanding language as 
a conversion from a string of of sounds or letters to an 
Internal representation of "meaning". In order to do this, a 
language-understanding system must have some formal way to 
express Its knowledge of a subject, and rust he able to 
represent the "meaning" of a sentence In this formalism. The 
formalism must be structured In such a way that the system can 
use Its knowledge to make deductions, accept new Information, 
answer questions, and interpret commands. Choosing a form for 
this Information Is of central Importance to both a practical 
system and a theory of semantics. 

First we must decide what kinds of things are to be 
represented In the formalism. As a beginning, we would like to 
be able to represent "objects", "properties," and "relations," 
Later we will have to show how these can be combined to express 
more complicated knowledge. We will describe ways to express 
the meaning of a wide variety of complex sentences. 

Using a simple prefix notation, we can represent such facts 
as "Boise Is a city. 1 ' and "Noah was the father of Jafeth," as: 
(CITY BOISE) (FATHER-OF NOAH JAFETH) 
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Here/ BOISE, NOAH, and JAFETH are specific objects, CITY Is 
a property which objects can have, and FATHER-OF Is a relation. 
It Is a practical convenience to list properties and relations 
first, even though this may not follow the natural English 
order, so we will do so throughout. Notice that properties are 
In fact special types of relations which deal with only one 
object. Properties and relations will be dealt with In 
Identical ways throughout the system. In fact. It Is not at all 
obvious which concepts should be considered properties and which 
relations. For example, "DeGaul le Is old." might be expressed 
as (OLD DEGAULLE) where OLD Is a property of objects or as (AGE 
DEGAULLE OLD), where AGE Is a relation between an object and Its 
age. In the second expression, OLD appears In the position of 
an object, even though It can hardly be construed as a 
particular object like BOISE or DEGAULLE. This suggests that we 
might like to let properties or relations themselves have 
properties and enter Into other relations. This has a deep 
logical consequence which will be discussed In later sections. 

In order to avoid confusion, we will need some convpntlons 
about notation. Host objects and relationships do x}.QX have 
simple Enellsh names, and those that do often share their names 
with a range of other meanings. The house on the corner by the 
market doesn't have a proper name like Jafeth, even though It Is 
just as much a unique object. For the Internal use of the 
system, we will give It a unique name by stringing together a 
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descriptive word and an arbitrary number, then prefixing the 
result with a colon to remind us It Is an object. The house 
mentioned above might be called :HOUSE37U, Properties and 
relations must also go under an assumed name, since (FLAT X) 
might mean very different things depending on whether X Is a 
tire or a musical note. We can do the same thing (using a 
different punctuatlor mark, O to represent these two meanings 
as *FLAT1 and *FLAT2. When the meaning Intended Is clear, we 
will omit the numbers, but leave the punctuation marks to remind 
us that It Is a property or relation rather than a specific 
object. Thus, our facts listed above should be written: 
(#CITY :B0ISE) (#FATHEP-0F :K0AH tJAFFTH), and either 

UOLD :DEGAULLE) or (#AGE :DEGAULLE #0LD). 

We are letting properties serve In a dual function — we 
can use them to say things about objects (as In "The sky Is 
blue. 11 — (#BLUE :SKY>) or we can say things about them as If 
they were objects (as In "Blue Is a color. " — UCOLOR *BLUE>). 
We want to extend this even further, and allow entire 
relationships to enter Into other relationships, (Vie 
distinguish between "relation 11 , the abstract symbol such as 
#FATHER-0F, and "relationship", a particular Instance such as 
UFATHER-OF : HDAH :JAFETH>). In accord with our earlier 
convention about naming things, we can give the relationship a 
name, so that we can treat It like an object and say (*KMOW ;l 
:REL76) where :REL76 Is a name for a particular relationship 
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like CFATHEU-OF :N0AH :JAFETH>. We can keep straight which 
name goes with which relatlonshEp hy putting the name directly 
Into the relationship. Our example would then become (*FATHER- 
OF :N0AH :JAFETH :REL76). There Is no special reason to put 
the name last, except that It makes Indexing and reading the 
statements easier. We can tell that :REL76 Is the name of this 
relation, and not a participant since FATHER-OF relates only two 
objects. Similarly, we knew that It has to be a participant In 
the relationship (-KNOW :l :REL76) since #KNDK needs two 
arguments. 

We now have a system which can be used to describe more 
complicated facts. "Harry slept on the porch after he gave 
Alice the jewels." would become a set of assertions: 
USLEEP : HARRY :REL1> ('LOCATION :REL1 :PORCH) 

('GIVE :HARRY :ALICE :JEV.'ELS :REL2) UAFTER :REU :REL2) 

This example points out several facts abut the notation. 
The number of participants In a relationship depends on the 
particular relation, and can vary from to any number. We do 
not need to give every relationship a name -- It Is present only 
If we want to be able to refer to that relationship elsewhere. 
This will often be done for events, which are a type of 
relationship with special properties (such as time and place of 
occurrence) . 



Section 3,1-2 - Page 210 



_£ Philosophical Considerations 
Before going on, let us stop and ask what we are doing. In 
the preceding paragraphs, we have developed a notation for 
representing certain kinds of meaning. In doing so we have 
glibly passed over Issues which have troubled phllsophers and 
linguists for thousands of years. Countless treatises and 
debates have tried to analyze just what It means to be an 
"object" or a "property 11 , and what logical status a symbol such 
as #BLUE or #CITY should have. We will not attempt to give a 
philosophical answer to these questions, but Instead take a more 
pragmatic approach to meaning. 

Language Is a process of communication between people, and 
is Inextricably enmeshed In the knowledge that those people have 
about the world. That knowledge Is not a neat collection of 
definitions and axioms, complete, concise and consistent. 
Rather It is a collection of concepts designed to manipulate 
Ideas. It Is In fact Incomplete, highly redundant, and often 
Inconsistent. There Is no self-contained set of "primitives" 
from which everything else can be defined. Definitions are 
circular, with the meaning of each concept depending on the 
other concepts . 

This might seem like a meaningless change — saying that the 
meaning of words Is represented by the equally mysterious 
meanings of "concepts" which exist In the speaker's and hearer's 
minds, but which are open to neither Immediate Introspection nor 
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experiment. However, there Is a major difference. The structure 

of concepts which is postulated can be manipulated by a logical 

system within the computer. The "Internal representation" of a 

sentence Is something which the system can obey, answer, or add 

to Its knowledge. It can relate a sentence to other concepts, 

draw conclusions from It, or store it In a way which makes It 

useable in further deductions and analysis. 

This can be compared to the use of "forces' 1 in physics. We 
have no way of directly observing a force like gravity, but by 
postulating Its existence, we can write equations describing It, 
and relate these equations to the physical events Involved. 
Similarly, the "concept" representation of meaning Is not 
intended as a direct picture of something which exists In a 
person's mind. It Is a fiction of the scientist, valid only In 
that It gives him a way to make sense of data, and predict 
actual behavior. 

The justification for our use of concepts In this system 

Is the way It actually carries out a dialog which simulates In 

many ways the behavior of a human language user. For a wider 

field of discourse. It would have to bo expanded In Its details, 

and perhaps In some aspects of Its overall structure. However 

the Idea Is the same -- that we can in fact gain a better 

understanding of language use by postulating these fictitious 

concepts and structures, and analyzing the ways In which they 

interact with language. 

The success of such a theory at actualy describing language 

will depend largely on the power and flexibility of the 

representation used for the concepts. Later sections of this 

chapter discuss the reasons why PLANNER Is particularly well 
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sul ted for this job. 

We would like to consider some concepts as "atomic". <l.e. 
concepts which are considered to have their own meaning rather 
than being just combinations of other more baste concepts). A 
property or relation Is atonic not because of some special 
logical status, but because It serves a useful purpose In 
relation to the other concepts In the speaker's model o* the 
world. For example, the concept *OLD Is surely not primitive, 
since It can be defined In terms of *AGE and nunber. However, 
as an atomic property It will often appear In knowledge about 
people, the way they look, the way they act, etc. Indeed, we 
could omit It and always express something like "having an age 
greater than 30", but our model of the world will be simpler and 
more useful If we have the concept *0L0 available Instead. 

There Is no sharp line dividing atomic concepts from non- 
atonic ones. It would be absurd to have separate atomic 
concepts for such things as ICITY-0F-POPULATION-23,U65 or 
#PER50N-WEIGHINR-BETWEEN-178-AHD-181. Rut It might In fact be 

useful to distinguish between ^BIG-CITY, #TOWN, and ^VILLAGE, or 
between IFAT, and #THIN, since our model may often use these 
distinctions. 

If our "atomic" concepts are not logically primitive, what 
kind of status do they have? What Is their "meaning"? How are 
they defined? The answer Is again relative to the world-model 
of the speaker. Facts cannot be classified as "those which 
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define a concept" and "those; whTch describe It," Ask someone to 
define #PERS0M or 'JUSTICE, and he will come up with a formula 
or slogan which Is very limited. ^JUSTICE Is defined In his 
world-model by a series of examples, experiences, and specific 
cases. The model Is circular, with the meaning of any concept 
depending on the entire knowledge of the speaker, (not just the 
kind which would be Included In a dictionary). There must be a 
close similarity between the models held by the speaker and 
listener, or there could be no communication. If my concept of 
^DEMOCRACY and yours do not coincide, we may have great 
difficulty understanding each other's political viewpoints. 
Fortunately, on simpler things such as *BLUE, #D0G, and RAFTER, 
there Is a pretty Rood chance that the models will be 
practically Identical. In fact, for simple concepts, we can 
choose a few primary facts about the concept and use them as a 
"definition" , which corresponds to the traditional dictionary. 
Returning to our notation, we see that It Is Intentionally 
general, so that out system can deal with concepts as people do. 
In English we can treat events and relationships as objects, as 
In "The war destroyed Johnson's rapport with the people." Within 
our representation of meaning we can similarly treat an event 
such as »WAR or a relationship of *RAPPORT In the same way we 
treat objects. We do not draw a sharp philosophical distinction 
between "specific objects", "properties", relationships", 
"events 11 , etc. 
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3.1.3 complex | nf grmaUon 

We now have a way to store a data base of assertions about 
particular objects, properties, and relationships. Next, we 
want to handle more complex Information, such as "All canaries 
are yellow.", or "A thesis Is acceptable If either It Is long or 
It contains a persuasive argument." This could be done using 

a formal language such as the predicate calculus. Basic logical 
relations such as Implies, or, and, there-exists, etc. are 
represented symbolically, and Information Is translated Into a 
"formula". Thus we might have: 



{FORALL (X) (IMPLIESUCANARY X)(*C0L0R X #YELL0W))> 

(FORALL (X)dMPLIES 

(AND HTHESIS X) 

(OR (HONG X) 

(EXISTS (Y) 

(AND ('PERSUASIVE Y) 
(•ARGUMENT Y) 
{♦CONTAINS X Y))))) 
(•ACCEPTABLE X)>) 

Figure 52 -- Predicate Calculus Representation 



Several notatlonal conventions are used. First, we need 
variables so that we can soy things about objects without naming 
particular ones. This Is done with the quantifiers FORALL and 
EXISTS. Second, we need logical relations like AND, OR, NOT, 
and IMPLIES. Using this formalism, we can represent a question 
as a formula to be "proved". To ask "Is Sam's thesis 
acceptable?" we could give the formula ('ACCEPTABLE :SAM-THESIS) 
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to a theorem prover to prove by manipulating the formulas end 

assertions In the data base according to the rules of logic. We 

would need some additional theorems which would allow the 

theorem prover to prove that a thesis Is long, that an argument 

Is acceptable, etc. 

In some theoretical sense, predicate calculus formulas 
could express all of our knowledge, but In a practical sense 
there Is something missing. A person would also have knowledge 
about how to go about doing the deduction. He would know that 
he should check the length of the thesis first, since he might 
be able to save himself the bother of reading It, and that he 
might even be able to avoid counting the pages If there Is a 
table of contents. In addition to complex Information about 
what must be deduced, he also knows a lot of hints and 
"heuristics" telling how to do It better for the particular 
subject being discussed. 

Most "theorem-proving 11 systems do not have any way to 

Include this additional Intelligence. Instead, they are limited 

to a kind of "working In the dark". A uniform proof procedure 

gropes Its way through the collection of theorems and 

assertions, according to some general procedure which does not 

depend on the subject matter. It tries to combine any facts 

which might be relevant, working from the bottom up. In our 

example given above, we might have a very complex theorem for 

deciding whether an argument Is persuasive. A uniform proof 

procedure might spend a great deal of time checking the 

persuasiveness of Bwery argument It knew about, since a clause 

of the form {PERSUASIVE X) might be relevant to the proof. What 

we would prefer Is a way for a theorem to guide the process of 

deduction In an Intelligent way. Carl Hewitt has worked with 
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this problem and has developed a theorem-proving language called 
PLANNER <Hewltt 1968, 1969>, In PLANNER, theorems are In the 
form of programs, which describe how to go about proving a goal, 
or how to deduce consequences from an assertion. This Is 
described at length In section 3.3, and forms the basis for the 
Inference part of our English unders tander. In PLANNER, our 
sentence about thesis evaluation could be represented as shown 
In Figure 53. 

This Is similar in structure to the predicate calculus 
representation given above, but there are Important differences. 
The theorem Is a program, where each logical operator i ndlcates 
a definite series of steps to be carried out. THGOAL says to 
try to find an assertion In the data base, or to prove It using 
other theorems. THUSE gives advice on what other theorems to 
use, and In what order. THAND and THOR are equivalent to the 
logical AND and OR except that they give a specific order In , 
which things should be tried. (The "lisping" Is to 
differentiate PLANNER names from the standard LISP functions AND 
and OR. This same convention Is used for all functions which 
have LISP analogs.) 

The theorem EVALUATE says chat If we ever want to prove 
that a thesis Is acceptable, we should first make sure It Is a 
thesis by looking In the data base. Next, we should try to 
prove that It Is long, first by using the theorem CONTENTS -CHECK 
Cwhlch would check the table of contents), and If that falls, by 
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(DEFINE THEOREM EVALUATE 

; EVALUATE Is Che name we are 
;glving to the theorem 

(THCONSECX Y) 

;thls Indicates the type of 
j theorem and names Its 
; variables 

(THGOAL(#THESIS J?X)) 

;show that X Is a thesis 

;the "$? M Indicates a variable 

(THOR 

;THOR Is like "or", trying things 
;ln the order given until one works 

(THGOALULONG S?X)(THUSE CONTENTS-CHECK C0UNTPAGE5)) 

;THUSE says to try the theorem 
;named CONTENTS-CHECK first, 
;then If that doesn't work, try 
; the one named COUNTPAGES 

(THANO 

;THAND Is 1 Ike "and" 

<THGOAL(#CONTAINS S?X i?Y)) 

;flnd something Y which Is 
;conta!ned In X 

CTHGOAL(*ARGUMENT $?Y)> 

;show that It Is an argument 

<THGOAL(#PERSUASIVE S7Y)(THTBF THTRUE)))))) 

; prove that It Is persuasive, us Ing 
;any theorems which are applicable 



Figure 53 -- PLANNER Representation 
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now 



using a theorem named COUHTPAGES (which might In fact call a 
simple LISP program which thumbs through the paper.) If they 
both fall, then we look In the data base for something contained 
In the thesis, check that It is an argument, and then finally 
try to prove that It Is persuasive. Here, we have used (THTBF 
THTRUE), which Is PLANNER'S way of saying "try anything you k 
which can help prove It". PLANNER must then go searching 

through all of Its theorems on persuasl vensess, just as any 
other theorem prover would. There are two Important changes, 
though. First, we never need to look at persuasiveness at all 
tf we are able to determine that the thesis Is long. Second, 
only look at the persuasiveness of arguments which we already 
know are a part of the thesis. We do not get sidetracked Into 
looking at the persuasiveness theorems except for the cases we 

real 1 y want. 

PLANNER also does a number of other things, like 
maintaining a dynamic data base (assertions can be added or 
removed to reflect the way the world changes In the course of 
time), allowing us to control how much deduction will be done 
when new facts are added to the data base, etc. These are all 
discussed In section 3.3. 
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1*1mM Questions. Statements, and Commands 

PLANNER Is particularly convenient for a language- 
understanding system, since It can express statements, contnands, 
and questions directly. We have already shown how assertions 
can be stated In simple PLANNER format. Commands and questions 
are also easily expressed* Since a theorem is written In the 
form of a procedure, we can let steps of that procedure actually 
be actions to be taken by a robot. The command "Pick up the 
block and put It In the box." could be expressed as a PLANNER 
program: 

(THAND(THGOAL( 'PICKUP :BL0CK23)) 

{THGOALCPUTIN :BLOCK23 :B0X7))> 

Remember that the prefix ": " and the number Indicate a spec! f lc 

object. The theorems for 'PICKUP and 'PUTIN would also be 

programs, describing the sequence of steps to be done. 

Earlier we asked about Sam's thesis In predicate calculus 

In PLANNER we can ask: 

{THGOAL {'ACCEPTABLE tSAH-THES I S) (THUSE EVALUATE)) 

Here we have specified that our theorem EVALUATE Is to be 

used. If we evaluated this PLANNER statement, the theorem would 

be called, and executed just as described on the previous pages. 

PLANNER would return one of the values "T" or "NIL" depending on 

whether the statement Is true or false. 

For a question like "What nations have never fought a war? 1 ' 

PLANNER has the function THFIND. We would ask; 
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(THFIND ALL $?X (X Y) 

(THGOALCNATION $?X)) 
(THNOT 
(THAND(THGOAL(#WAR $?Y)) 

(THGOAL('PARTICIPATED $?X $?Y))))) 

and PLANNER would return a list of all such countries. Using 

our conventions for giving names to relations and events, we 

could even ask: 

(THFIND ALL $?X <X Y Z EVENT) 
(THGOAL(#CHICKEN $?Y)) 
(THGOAL(#ROAD $?Z)> 
(THG0AL(#CR0SS $?Y $?Z J7EVENTJ) 
(THGOALCCAUSE $?X I? EVENT))) 

This brief description has explained the basic concepts 

underlying the deductive part of our language understanding 

program. To go with It, we need a complex model of the subject 

being discussed. This Is described In section l.k. Section 3.3 

gives more details about the PLANNER language and Its uses. 
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3,2 Comparison with Previous Programs 
In Section 3.1 we discussed ways of representing 
Information and meaning within a language-comprehendlng system. 
In order to compare our Ideas with those In previous systems, we 
will establish a broad classification of the field. Of course/ 
no set of pigeon-holes can completely characterize the 
differences between programs, but they can give us some 
viewpoints from which to analyze different people's work, and 
can help us see past the superficial differences. We will deal 
only with the ways that programs represent their Information 
about the subject matter they discuss. Issues such as parsing 
and semantic analysis techniques are discussed In other 
sections. We will distinguish four basic types of systems 
called "special format", "text based", "restricted logic", and 
"general deductive". 
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3-2,1 Special Format System 

Most of the early language undestandlng programs were of 
the special format type. Such systems usually use two special 
formats designed for their particular subject matter — one for 
representing the knowledge they keep stored away, and the other 
for the meaning of the English Input. Some examples are: 
BASEBALL <P.F.Green>, which stored tables of baseball results 
and Interpreted questions as "specification lists" requesting 
data from those tables; SAO SAM <Llndsay>, which Interpreted 
sentences as simple relationship facts about people, and stored 
these In a network structure; STUDENT <Bobrow 196U>, which 
Interpreted sentences as linear equations and could store other 
linear equations and manipulate them to solve algebra problems; 
and ELIZA <Wel2enbaum 19G6>, whose Internal knowledee Is a set 
of sentence rearrangements and key words, and which sees Input 
as a simple string of words. 

These programs all make the assumption that the only 
relevant Information In a sentence Is that which fits their 
particular format. Although they may have very sophisticated 
mechanisms for using this Information (as In CARPS <Charniak>, 
which can solve word problems In calculus), they are each built 
for a special purpose, and do not handle information with the 
flexibility which would allow them to be adapted to other uses, 
Nevertheless, their restricted domain often allows them to use 
very clever tricks, which achieve Impressive results with a 
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minimum of concern for the complexities of language. 
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3.2.2 Text Based Systems 

Some researchers were not satisfied with the limitations 
Inherent In the special-format approach. They wanted systems 
which were not limited by their construction to a particular 
specialized field. Instead they used English text, with all of 
Its generality and diversity, as a basis for storing 
Information. In these "text based" systems, a body of text Is 
stored directly, under some sort of Indexing scheme. An English 
sentence Input to the understander Is Interpreted as a request 
to retrieve a relevant sentence or group of sentences from the 
text. Various Ingenious methods were used to find possibly 
relevant sentences and decide which were most likely to sastlsfy 

the request. 

PROTQSYHTHEX 1 <Slmmons 1966> had an Index specifying all 
the places where each "content word" was found In the text. It 
tried to find the sentences which had the most words In common 
with the request (using a special weighting formula), then did 
some syntactic analysis to see whether the words In common were 
In the right grammatical relationship to each other. Semantic 
Memory <Qullllan 1966> stored a sllghtty processed version of 
English dictionary definitions In which multiple-meaning words 
were eliminated by having humans Indicate the correct 
Interpretation. It then used an associative Indexing scheme 
which enabled the system to follow a chain of Index references. 
An Input request was In the form of two words Instead of a 
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sentence. The response was the shortest chain which connected 
them through the associative Index (e.g. If there Is a 
definition containing the words A and B and one containing B and 
C, a request to relate A and C will return both sentences). 

Even with complex Indexing schemes, the text based approach 
has a basic problem. It can only spout back specific sentences 
which have been stored away, and can not answer any question 
which demands that something be deduced from more than one piece 
of Information. In addition. Its responses often depend on the 
exact way the text and questions are stated In English, rather 
than dealing with the underlying meaning. 
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3-2.3 Limited LPfllC 5VSte"19 

The "limited logic" approach attempted to correct these 
faults of text based systems, and has been used for most of the 
more recent language understanding programs. First, some sort 
of more formal notation Is substituted for the actual English 
sentences In the base of stored knoledge. This notation may 
take many different forms, such as "desrlptlon lists" <Raphael 
196H>, "kernels" <Slmmons 1968> "concept-relatlon-concept 
triples" <Simmons 1969>, "data nodes" <QuIlllan 1969> , "rings" 
<Thompson>, "relational operators" <Tharp> , etc. Each of these 
forms Is designed for efficient use In a particular system, but 
at heart they are all doing the same thing — providing a 
notation for simple assertions of the sort described In section 
3.1.1. It Is relatively unimportant which special form Is 
chosen. All of the different methods can provide a uniform 
formalism which frees simple Information from being tied down to 
a specific way of expressing it in English. Once this Is done, 
a system must have a way of translating from the English Input 
sentences Into this Internal assertion format, and the greatest 
bulk of the effort in language understanding systems has been 
this "semantic analysis". We will discuss It at length In 
chapter U. For now we are more Interested In what can be done 
with the assertions once they have been put Into the desired 

form. 

Some systems (see <Qullllan 1969>, <Tharp>) remain close to 
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text based systems, only partially breaking down the Initial 
text Input. The text Is processed by some sort of dependency 
analysis and left In a network form, either emphasizing semantic 
relationships or remaining closer to the syntactic dependency 
analysis, what Is common to these systems Is that they do not 
attempt to answer questions from the stored Information. As 
with text based systems, they try to answer by giving back bits 
of Information directly from the data base. They may have 
clever ways to decide what parts of the data are relevant to a 
request, but they do not try to break the question down and 
answer It by logical Inference. Because of this, they suffer 
the same deficiencies as text based systems. They have a mass 
of Information stored away, but little way to use It except to 
print It back out. 

Most of the systems which have been developed recently fit 
more comfortably under the classification "limited logic". In 
addition to their data base of assertions (whatever they are 
called), they have some mechanism for accepting more complex 
Information, and using It to deduce the answers to more complex 
questions. By "complex Information" we mean the type of 
knowledge described In section 3.1.3. This Includes knowledge 
containing logical quantifiers and relationships <such as "Every 
canary Is either yellow or purple," or "If A Is a part of B and 
B Is a part of C, then A Is a part of C. ,f K By "complex 
questions", we mean questions which are not answerable by giving 
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out one of the data base assertions, but demand some logical 
Inference to produce an answer. 

One of the earliest limited logic programs was SIR <Raphael 
196ii> # which could answer questions using simple logical 
relations (like the "part" example In the previous paragraph). 
The complex Information was not expressed as data, but was built 
directly Into the SIR operating program. This meant that the 
types of complex Information It could use were highly limited, 
and could not be easily changed or expanded. The complex 
questions It could answer were similar to those In many later 
limited logic systems, consisting of four basic types. The 
simplest Is a question which translates Into a single assertion 
to be verified or falsified (e.g. "Is John a bagel?") The second 
Is an assertion In which one part Is left undetermined (e.g. 
"Who Is a bagel?") and the system responds by "filling In the 
blank". The third type Is an extension of this, which asks for 
all possible blank-fillers (e.g. "Name all bagels."), and the 
fourth adds counting to this listing facility to answer count 
questions (e.g. "How many bagels are there?"). SIR had special 
logic for answering "how many" questions, using Information like 
"A hand has 5 fingers.", and in a similar way each limited logic 
system had special built-in mechanisms to answer certain types 
of questions . 

The DEACON system <Thompson> had special "verb tables" to 
handle time questions, and a bottom-up analysis method which 
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allowed Questions to be nested. For example, the question "who 
Is the commander of the batalllon at Fort Fubar?" was handled by 
first Internally answering the question "What batalllon is at 
Fort Fubar?" The answer was then substituted directly Into the 
original question to make It "Who Is the commander of the 69th 
batalllon?", which the system then answered. PROTOSYNTHEX II 
<Slmmons 1968> had special logic for taking advantage of the 
transitivity of "Is" (e.g. "A boy Is a person.", "A person Is an 
animal." therefore "A..."). PROTOSYNTHEX ||| <SImroons 1969> and 
SAMENLAQ II <ShapIro> bootstrapped their way out of first-order 
logic by allowing simple assertions about relationships (e.g. 
"North-of Is the converse of South-of."). CONVERSE <Kellogg> 
converted questions Into a "query language" which allowed the 
form of the question to be more complex but used simple table 
lookup for finding the answers. 

All of the limited logic systems are basically similar. In 
that complex Information Is not part of the data, but Is built 
Into the system programs. Those systems which could add to 
their Initial data base by accepting English sentences could 
accept only simple assertions as Input. The questions could not 
Involve complex quantified relationships (e.g. "Is there a 
country which Is smaller than every U.S. state?). 
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3.2.1* General DgflucUve System? 

The problems of limited logic systems were recognized very 
early (see <Raphael 196U> p. 90), and people looked for a more 
general approach to storing and using complex Information. If 
the knowledge could be expressed In some standard mathematical 
notation (such as the predicate calculus), then all of the work 
logicians have done on theorem proving could be utilized to make 
an efficient deductive system. By expressing a question as a 
theorem to be proved (see section 3.1.3), the theorem prover 
could actually deduce the Information needed to answer any 
question which could be expressed In the formalism. Complex 
Information not easily useable In limited logic systems could be 
neatly expressed In the predicate calculus, and a body of work 
already existed on computer theorem proving. This led to the 
"general deductive" approach to language understanding programs. 
The early programs used logical systems less powerful than 
the full predicate calculus (see <Bar-Hlllel>, <Coles 196B>, and 
<Darl lngton>) but the big boost to theorem proving research was 
the development of the Robinson resolution algorithm <Roblnson>, 
a very simple "complete uniform proof procedure" for the first 
order predicate calculus. This meant that It became easy to 
write an automatic theorem proving program with two Important 
characteristics. First, the procedure Is "uniform" -- we need 
not (and In fact, cannot) tell It how to go about proving things 
In a way suited to particular subject matter. It has Its own 
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fixed procedure for building proofs, and we can only change the 
sets of logical statements (or "axioms") for It to work on. 
Second, It guarantees that If any proof Is possible using the 
rules of predicate calculus, the procedure will eventually find 
It (even though It may take a very long time). These are very 
pretty properties for an abstract deductive system, but the 
question we must ask Is whether their theoretical beauty Is 
worth paying the price of low practicality. We would like to 
argue that In fact they have led to the worst deficiencies of 
the theorem-proving question-answerers, and that a very 
dl f ferent approach Is cal led for. 

The "uniform procedure" approach was adopted by a number of 
systems (see <Green 1968, 1969>) as an alternative to the kind 
of specialized limited logic discussed In the previous section. 
It was felt that there must be a way to present complex 
Information as data rather than embedding It Into the Inner 
workings of the language understanding system. There ere many 
benefits In having a uniform notation for representing problems 
and knowledge In a way which does not depend on the quirks of 
the particular program which will Interpret them. It enables a 
user to describe a body of knowledge to the computer In a 
"neutral" way without knowing the details of the question- 
answering system, and guarantees that the system will be 
applicable to any subject, rather than being specialized to 
handl e onl y one. 
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Predicate calculus seemed to be a good uniform notation, 
but In fact It has a serious deficiency. By putting complex 
Information Into a "neutral" logical formula, these systems 
Ignored the fact that an Important part of a person's knowledge 
concerns how to go about figuring things out. Our heads don t 
contain neat sets of logical axioms from which we can deduce 
everything through a "proof procedure". Instead we have a large 
set of heuristics and procedures for solving problems at 
different levels of generality. Of course, there Is no reason 
why a computer should do things the way a person does, but In 
Ignoring this type of knowledge, programs run Into tremendous 
problems of efficiency. As soon as a "uniform procedure" 
theorem prover gets a large set of axioms (even well below the 
number needed for really understanding language). It becomes 
bogged down In searching for a proof, since there Is no easy way 
to guide Its search according to the subject matter. In 
addition, a proof which takes many steps (even If they are In a 
sequence which can be easily predicted by the nature of the 
theorem) may take Impossibly long since It Is very difficult to 
describe the correct proving procedure to the system. 

It Is possible to write theorems In a clever way In order 

to Implicitly guide the deduction process, and a recent paper 

<Green 1969> describes some of the problems In "techniques for 

"programming" In first-order logic". First order logic Is a 

declarative rather than Imperative language, and to get an 

Imperative effect (I.e. telling It how to go about doing 

something) takes a good deal of careful thought and clever 

trickery. 

It might be possible to add strategy Information to a 

predicate calculus theorem prover, but with current systems such 
as QA3, "To change strategies In the current version, the user 
must know about set-of-support and other program parameters such 
as level bound and term-depth bound. To radically change the 
strategy, the user presently has to know the LISP language and 
must be able to modify certain strategy sections of the 
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program." (<Green 1969> p. 236). In newer programs such as QAta, 
there will be a special strategy language to go along with the 
theorem-proving mechanisms. It will be Interesting to see how 
close these new strategy languages are to PLANNER, and whether 
there Is any advantage to be gained by putting them In a hybrid 
with a resolution-based system. As to the completeness 
argument, there are good reasons not to have a complete system - 
- these are discussed later In this section. 
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S.2.5 Procedural DfldUCtlYe Syiifijnfi 

The problem with the limited logic systems wasn't the fact 
that they expressed their complex Information In the form of 
programs or procedures. The problem was that these programs 
were organized In such a way that "...each change In a 
subprogram may affect more of the other subprograms. The 
structure grows more awkward and difficult to 
general ize. * .Final ly the system may become too unwieldy for 
further experimentation.' 1 KRaphael 196<i> p. 91). Nevertheless, 
It was necessary to build In more and more of these subprograms 
In order to accept new subject matter. 

What was needed was the development of new programming 
techniques so that systems could retain the capability of using 
procedural Information, but at the same time express this 
Information In a simple and straightforward way which did not 
depend on the peculiarities and special strucuture of a 
particular program or subject of discussion. 

A system which partially fits this description Is Woods' 
<Vloods 1968>. It uses a quant I flcat tonal query language for 
expressing questions, then assumes that there are "semantic 
primitives" In the form of LISP subroutines which decide such 
predicates as (CONNECT FLIGHT-23 BOSTON CHICAGO) and which 
evaluate functions such as "number of stops", "owner", etc. The 
thing which makes this system different from the limited logic 
systems Is that the entire system was designed without reference 
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to the way the particular "prlml t I ve" functions would operate on 
the data base. In a way, this Is avoiding the Issue, since the 
Information which the system was designed to handle (the 
Official Airline Guide) Is particularly amenable to simple 
table-lookup routines. If we had to handle less structured 
Information of the type usually done with theorem provers, these 
primitive routines might indeed run Into the same problems of 
Interconnectedness described in the quote above, and would 
become harder and harder to generalize- 

PLANNER was designed by Carl Hewitt as a goal -or lenced 
procedural language to deal with these problems. It has special 
mechanisms for dealing with assertions In an efficient way, and 
In addition has the capabllty to Include any complex Information 
which can be expressed in the predicate calculus. More 
important, the complex information Is expressed In the form of 
procedures, which can Include all sorts of knowledge of how to 
best go about proving things. The language is "goal -oriented". 
In that we do not have to be concerned about the details of 
Interaction between the different procedures. If at different 
places In our knowledge we have theorems which ask whether an 
object Is sturdy (for example In a theorem about support, about 
building houses, etc.) they are not forced to specify the 
program which will serve as sturdlness-Inspector. Instead they 
say something like M Try to find an assertion that X Is sturdy, 
or prove It using anything you can, 11 If we know of special 
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procedures which seem most likely to give a quick answer, we can 

specify that these should be trTed first. But If at some point 

we add a new sturdlness-tester, we do not need to find out which 

theorems use It. We need only add It to the data base, and the 

system will automatically try It (along with any other 

sturdf ness-tes ters) whenever any theorem Rives the go-ahead. 

The ability to add new theorems without relating them to 
other theorems Is the advantage of a "uniform" notation. In 
fact PLANNER Is a uniform notation for expressing procedural 
knowledge just as predicate calculus Is a notation for a more 
limited range of Information. The advantage Is that PLANNEP has 
a hierarchical control structure. In addition to specifying 
logical relationships, a theorem can take over control of the 
deduction process. 

We can have complete control over how the system will 
operate. In any theorem, we can tell It to try to prove a 
subgoal using only certain theorems (If we know that the goal Is 
bound to fall unless one of them works), we can tell It to try 
things In a certain order (and the choice of this order can 
depend on arbitrarily complex calculations which take place when 
the subgoal Is set up) or we can even write a "spoiler" theorem, 
which can tell the system that a goal Is certain to fall, and 
that no other theorems should even be tried. 

Notice that this control structure makes It very difficult 
to characterize the abstract logical properties of PLANNER, such 
as consistency and completeness* It Is worth pointing out here 
that completeness may In fact be a bad property. It means (we 
believe, necessarily) that If the theorem-prover Is given 
something to prove which Is In fact false. It will exhaust every 
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possible way of trying to prove It, By forsaking completeness, 
we allow ourselves to use good sense In deciding when to give 
up. 

In a truly uniform system, the theorem prover Is forced to 
"rediscover the world" every time It answers a question. Every 
goal forces It to start from scratch, looking at all of the 
theorems In the data base (perhaps using some subject-matter- 
free heuristics to make a rough selection). Because It does not 
want to be limited to domain-dependent Information, It cannot 
use It at all, PLANNER can operate In this "bllndman" mode If 
we ask It to (and It Is less efficient at doing so than a 
procedure specially Invented to operate this way), but It should 
have to do this only rarely -- when discovering something which 
was not known or understood when the basic theorems were 
written. The rest of the time It can go about proving things 
which It knows how to do, without a tremendous overhead of 
having to piece together a proof from scratch each time. As 
mentioned above. It might be possible to patch "strategy 
programs" onto theorems In conventional theorem-provers In order 
to accomplish the same Koal , In PLANNER we have the advantage 
that this can be done naturally using the notation, and the 
strategy Is embedded In the PLANNER theorems, which themselves 
can be looked at as data. In an advanced system a PLANNER 
program could be written to learn from experience. Once the 
"bllndman mode" finds a proof, the method It used could be 
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remembered and tried first when a similar goal Is generated 

again. See section 5.1 for more discussion of learning. 

To those accustomed to uniform proof procedures, this all 

sounds like cheating. Is the system really proving anything If 

you are giving It clues about what to do? Why Is It different 

from a simple set of programmed LISP procedures like those 

envisioned by Woods? First, the language is designed so that 

theorems can be written Independently of each other, without 

worrying about when they will be called, or what other theorems 

and data they will need to prove their subgoals. 

The language Is designed so that If we want, we can write 
theorems In a form which Is almost Identical to the predicate 
calculus, so we have the benefits of a uniform system. On the 
other hand, we have the capability to add as much subject- 
dependent knowledge as we want, telling theorems about other 
theorems and proof procedures. The system has an automatic 
goal-tree backup system, so that even when we are specifying a 
particular order In which to do things, we rray not know how the 
system will go about doing them. It will be able to follow our 
suggestions and try many different theorems to establish a goal, 
backing up and trying another automatically If one of them leads 
to a failure (see section 3.3). 

In summary, the main advance In a deductive system using 

PLANNER Is In allowing ourselves to have a data base of 

procedures rather than formulas to express complex Information. 

This combines the generality and power of a theorem prover with 

the ability to accept procedural knowledge and heuristics 

relevant to the data. It provides a flexible and powerful tool 

to serve as the basis for a language understanding system. The 

rest of this chapter describes the PLANNER language and the way 

It Is used in our system. 
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3*3 Programming In PLANNER 
?■?■! fiASl£ Operation flf PLANNER 

The easiest way to understand PLANNER Is to watch how It 
works, so In this section we will present a few simple examples 
and explain the use of some of Its most elementary features. 

First we will take the most venerable of traditional 
deductions: 

Turing Is a human 
All humans are fallible 
so 

Turing Is fallible. 

It Is easy enough to see how this could be expressed In the 
usual logical notation and handled by a uniform proof procedure. 
Instead, let us express It In one possible way to PLANNER by 
saying: 

(THASSERT (HUMAN TURING)) 

;Thls asserts that Turing Is human, 
(DEFPROP THE0REM1 

(THCONSE (X) (FALLIBLE J?X) 

(THGOAL (HUMAN $?X))> 
THEOREM) 

;Thls Is one way of saying that all humans 
;are fallible. 

The proof would be generated by asking PLANNER to evaluate 

the expression: 

(THGOAL (FALLIBLE TURING) (THTBF THTRUE)) 
We Immediately see several points. First, there are two 
different ways of storing Information. Simple assertions are 
stored In a data base of assertions, while more complex 
sentences containing quantifiers or logical connectives are 
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expressed In the form of theorems. 

Second, one of the most Important points about PLANNER Is 
that It Is an evaluator for statements written In a programming 
language. It accepts Input In the form of expressions written 
In the PLANNER language, and evaluates them, producing a value 
and side effects. THASSERT Is a function which, when evaluated, 
stores Its argument In the data base of assertions or the data 
base of theorems {which are cross-referenced In various ways to 
give the system efficient look-up capabilities). A theorem Is 
defined with OEFPROP as are functions In LISP. 

In this example we have defined a theorem of the THCONSE 
type (THCONSE means consequent; we will see other types later). 
This states that If we ever want to establish a goal of the form 
(FALLIBLE $?X), we can do this by accomplishing the goal (HUMAN 
$?X), where X Is a variable. The strange prefix characters are 
part of PLANNER'S pattern matching capabilities. If we ask 
PLANNER to prove a goal of the form <A X), there Is no obvious 
way of knowing whether A and X are constants (like TURING and 
HUMAN In the example) or variables. LISP solves this problem by 
using the function QUOTE to Indicate constants. In pattern 
matching this Is Inconvenient and makes most patterns much 
bulkier and more difficult to read. Instead, PLANNER uses the 
opposite convention -- a constant Is represented by the atom 
Itself, while a variable must be Indicated by adding an 
appropriate prefix. This prefix differs according to the exact 
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use of the variable In the pattern, but for the time being let 
us just accept $? as a prefix Indicating a variable. The 
definition of the theorem Indicates that It has one variable, X, 
by the (X) following THCONSE. 

The third statement Illustrates the function THGOAL, v/hlch 
calls the PLANNER Interpreter to try to prove an assertion. 
This can function In several ways. If we had asked PLANNER to 
evaluate (THGOAL (HUMAN TURING)) It would have found the 
requested assertion Immediately In the data base and succeeded 
(returning as Its value some Indicator that It had succeeded). 
However, (FALLIBLE TURING) has not been asserted, so we must 
resort to theorems to prove It. 

Later we will see that a THGOAL statement can give PLANNER 
various kinds of advice on which theorems are applicable to the 
goal and should be tried. For the moment, (THTBF THTRUE) Is 
advice that causes the evaluator to try all theorems whose 
consequent Is of a form which matches the goal, (I.e. a theorem 
with a consequent (S?Z TURING) would be tried, but one of the 
form (HAPPY $?Z) or (FALLIBLE $?Y $?Z) would not. Assertions 
can have an arbitrary list structure for their format •- they 
are not limited to two-member lists or three-member lists as In 
these examples.) The theorem we have just defined would be 
found, and In trying It, the match of the consequent to the 
goal would cause the variable 1?X to be assigned to the constant 
TURING. Therefore, the theorem sets up a new goal (HUMAN 
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TURING) and this succeeds Immediately since It Is In the data 
base. In general/ the success of a theorem will depend on 
evaluating a PLANNER program of arbitrary complexity. In this 
case It contains only a single THGOAL statement, so Its success 
causes the entire theorem to succeed, and the goal (FALLIBLE 

TURING) Is proved. 

Consider the question "Is anything fallible?", or In 

logic, {EXISTS (Y)tFALLIBLE Y)>, This requires a variable and 

It could be expressed In PLANNER as: 

(THPROG CY) (THGOAL (FALLIBLE J?Y)(THTBF THTRUE))) 

Notice that THPROG (PLANNER'S equivalent of a LISP PROG, 

complete with GO statements, tags, RETURN, etc.) acts as an 

existential quantifier. It provides a binding-place for the 

variable Y, but does not Initialize It -- It leaves It In a 

state particularly marked as unasslgned. To answer the 

question, we ask PLANNER to evaluate the entire THPROG 

expression above. To do this It starts by evaluating the THGOAL 

expression. This searches the data base for an assertion of the 

form (FALLIBLE $?Y) and falls. It then looks for a theorem with 

a consequent of that form, since the recommendation (THTBF 

THTRUE) says to look at all possible theorems which might be 

applicable. When the theorem defined above Is called, the 

variable X In the theorem Is Identified with the variable Y In 

the goal, but since Y has no value yet, X does not receive a 

value. The theorem then sets up the goal (HUMAN $?X) with X as 
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a variable. The data-base searching mechanism takes this as a 
command to look for any assertion which matches that pattern 
(I.e. an Instant iat Ion), and finds the assertion (HUMAN TURING). 
This causes X (and therefore Y) to be assigned to the constant 
TURING, and the theorem succeeds, completing the proof and 
returning the value (FALLIBLE TURING). 
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S.3.2 PacKmp 

There seems to be something missing. So far, the data base 

has contained only the relevant objects, and therefore PLANNER 

has found the right assertions Immediately. Consider the 

problem we would get If we added new Information by evaluating 

the statements: 

(THASSERT (HUMAN SOCRATES)) 
(THASSERT (GREEK SOCRATES)) 

Our data base now contains the assertions: 

(HUMAN TURING) 
(HUMAN SOCRATES) 
(GREEK S0CRATE5) 

and the theorem: 

(THCONSE (X) (FALLIBLE $?X) 

(THGOAL (HUMAN $?X))) 

What If we now ask, "Is there a fallible Greek7" In PLANNER 

we would do this by evaluating the expression: 

(THPROC (X) (THGOAL (FALLIBLE $?X)(THTBF THTRUE)) 
(THGOAL (GREEK S?X))) 

THPROG acts like an AND, Insisting that all of Its terms 

are satisfied before the THPROG Is happy. Notice what might 

happen. The first THGOAL may be satisfied by the exact same 

deduction as before, since we have not removed Information. If 

the data-base searcher happens to run Into TURING before It 

finds SOCRATES, the goal (HUMAN $?X) will succeed, assigning $?X 

to TURING. After (FALLIBLE $?X) succeds, the THPROG will then 

establish the new goal (GREEK TURING), which Is doomed to fall 

since It has not been asserted, and there are no applicable 
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theorems. If we think In LISP terms, this Is a serious problem, 
since the evaluation of the first THGOAL has been completed 
before the second one Is called, and the "push-down list" now 
contains only the THPROG, If we try to go back to the beginning 
and start over. It will again find TURING 3nd so on, ad 
Infinitum, 

One of the most Important features of the PLANNER language 
Is that backup In case of failure Is always possible, and 
moreover this backup can go to the last place where a decision 
of any sort was made. Here, the decision was to pick a 
particular assertion from the data base to match a goal. Other 

decisions might be the choice of a theorem to satisfy a goal, or 
a decision of other types found In more complex PLANNER 
functions such as THOR (the equivalent of LISP OR). PLANNER 
keeps enough Information to change any decision and send 
evaluation back down a new path. 

In our example the decision was made Inside the theorem for 
FALLIBLE, when the goal (HUMAN $?X) was matched to the assertion 
(HUMAN TURING), PLANNER will retrace Its steps, try to find a 
different assertion which matches the goal, find (HUMAN 
SOCRATES), and continue with the proof. The theorem will 
succeed with the value (FALLIBLE SOCRATES), and the THPROG will 
proceed to the next expression, (THGOAL (GREEK $?X)). Since X 
has been assigned to SOCRATES, this will set up the goal (GREEK 
SOCRATES) which will succeed Immediately by finding the 
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corresponding assertion In the data base. Since there are no 
more expressions in the THPROG, It will succeed, returning as 
Its value the value of the last expression, (RREEK SOCRATES). 
The whole course of the deduction process depends on the failure 
mechanism for backing up and trying things over (this Is 
actually the process of trying different branches down the 
subgoal tree.) All of the functions like THCOND, THAND, THOR, 
etc. are controlled by success vs. failure. Thus It Is the 
PLANNER executive which establishes and manipulates subgoals In 
looking for a proof. 
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1*1*1 Differences with Other Thepren-Prgygrs ^ ^ nfig a^e^ 

Although PLANNER Is written as a programming language. It 

differs In several critical ways from anything which Is normally 

considered a programming language. First, It Is goal -dl rected. 

Theorems can be thought of as subroutines, but they can be 

called through a very general pattern-matcher which looks at the 

goal which Is to be satisfied. This Is like having the abllltly 

to say "Call a subroutine which will achieve the desired result 

at this point." Second, the evaluator has the mechanism of 

success and failure to handle the exploration of the subgoal 

tree. Other languages, such as LISP, with a basic recurs 1 ve 

evaluator have no way to do this. Third, PLANNER contains a 

bookkeeping system for matching patterns and manloulatlng a data 

base, and for handl lng that data base efficiently. 

How Is PLANNER different from a theorem prover? What Is 
gained by writing theorems In the form of programs, and giving 
them power to call other programs which manipulate data? The 
key Is In the form of the data the theorem-prover can accept. 
Host systems take declarative Information, as In predicate 
calculus. This Is In the form of expressions which represent 
"facts" about the world. These &re manipulated by the theorem- 
prover according to some fixed uniform process set by the 
system. PLANNER can make use of Imperative Information, 
telling It how to go about proving a subgoal, or to make use of 
an assertion. This produces what Is called hlerarchl cal control 
structure . That Is, any theorem can Indicate what the theorem 
prover Is supposed to do as It continues the proof. It has the 
full power of a general programming language to evaluate 
functions which can depend on both the data base and the subgoal 
tree, and to use Its results to control the further proof by 
making assertions, deciding what theorems are to be used, and 
specifying a sequence of steps to be followed. 

What does this mean In practical terms? In what way does 

It make a "better" theorem prover? We will give several 
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examples of areas where the approach is Important. 

First, consider the basic problem of deciding what subgoals 
to try In attempting to satisfy a goat. Very often, knowledge 
of the subject matter will tell us that certain methods are very 
likely to succeed, others may be useful If certain other 
conditions are present, while others may be possibly valuable, 
but not likely. We would like to have the ability to use 
heuristic programs to determine these facts and direct the 
theorem prover accordingly. It should be able to direct the 
search for goals and solutions In the best way possible, and 
able to bring as much Intelligence as possible to bear on the 
decision. In PLANNER this Is done by adding to our THGOAL 
statement a reconvnendat Ion 1 1st which can specify that ONLY 
certain theorems are to be tried, or that certain ones are to be 
tried FIRST In a specified order. Since theorems are programs, 
subroutines of any type can be called to help make this decision 
before establishing a new THGOAL. Each theorem has a name (In 
our definition at the beginning of Section 3.1,1, the theorem 
was given the name THEOREM!)* to facilitate referring to It 
expl Id tly. 

The simplest kind of recommendation Is THU5E, which takes a 
list of theorems (by names) and recommends that they be tried In 
the order listed. A more general recommendation uses f 1 1 ters 
which look at the theorem and decide whether It should be tried. 
The user defines his own f 1 1 ters, except for the standard f 1 1 ter 
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THTRUE, which accepts any theorem. 

The filter command for theorems Is THT8F, so a 
recofmiendat Ion 1 1st of the form: 

(CTHUSE TH1 TH2KTHTBF TESTXTHUSE TH-DESPERATION)} 
would mean to first try the theorem named TH1, then TH2, then 
any theorem which passes the filter named TEST (which the user 
would define), then If all that falls, use the theorem named TH- 
DESPERATION. In our programs, we have made use of only the 
simple capabilities for choosing theorems -- we do not define 
filters other than THTRUE. However, there Is also a capability 
for filtering assertions In a similar way, and we do use this, 
as explained In section (4.3, 
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3-5.1* Control 1 Ing xM Data Base 

An Important problem Is that of maintaining a data base 
with a reasonable amount of material. Consider the first 
example above* The statement that all humans are fallible/ 
while unambiguous In a declarative sense Is actually ambiguous 
In Its Imperative sense (I.e. the way It Is to be used by the 
theorem prover). The first way Is to simply use It whenever we 
are faced with the need to prove (FALLIBLE $?X). Another way 
might be to watch for a statement of the form (HUMAN $?X) to be 
asserted, and to Immediately assert (FALLIBLE S7X) as well. 
There Is no abstract logical difference, but the Impact on the 
data base Is tremendous. The more conclusions we draw when 
Information Is asserted, the easier proofs will be, since they 
will not have to make the additional steps to deduce these 
consequences over and over again. However since we don't have 
Infinite speed and size. It Is clearly folly to think of 
deducing and asserting everything possible (or even everything 
Interesting) about the data when it Is entered. If we were 
working with totally abstract meaningless theorems and axioms 
(an assumption which would not be Incompatible with many 
theorem-proving schemes), this would be an Insoluble dilemma. 
But PLANNER Is designed to work In the real world, where our 
knowledge Is much more structured than a set of axioms and rules 
of Inference. We may very well, when we assert (LIKES S?X 
POETRY) want to deduce and assert (HUMAN $?X), since In 
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deducing things about an object. It will very often be relevant 

whether that object Is human, and we shouldn't need to deduce It 

each time. On the other hand, ft would be silly to assert (HAS- 

AS-PART S?X SPLEEN), since there Is a horde of facts equally 

Important and equally limited In use. Part of the knowledge 

which PLANNER should have of a subject, then. Is what facts are 

Important, and when to draw consequences of an assertion. This 

Is done by having theorems of an antecedent type: 

(DEFPROP THE0REM2 

(THANTE (X Y) (LIKES S?X $7Y) 

(THASSERT (HUMAN J?X))> 
THEOREM) 

This says that when we assert that X likes something, we 
should also assert (HUMAN J?X). Of course, such theorems do not 
have to be so simple. A fully general PLANNER program can be 
activated by an THANTE theorem, doing an arbitrary (that Is, the 
programmer has free choice) amount of deduction, asssertlon, 
etc. Knowledge of what we are doing In a particular problem may 
Indicate that It Is sometimes a good Idea to do this kind of 
deduction, and other times not. As with the CONSEQUENT 
theorems, PLANNER has the full capacity when something Is 
asserted, to evaluate the current state of the data and proof, 
and specifically decide which ANTECEDENT theorems should be 
called. 

PLANNER therefore a I lows deductions to use all sorts of 
knowledge about the subject matter which go far beyond the set 
of axioms and basic deductive rules, PLANNER Itself Is subject- 
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Independent, but Its power Is that the deduct Ion process s never 
needs to operate on such a level of Ignorance, The programmer 
can put In as much heuristic knowledge as he wants to about the 
subject, just as a good teacher would help a class to understand 
a mathematical theory, rather than just telling them the axioms 
and then giving theorems to prove. 
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1*1*1 Eygnts sad States 

Another advantage In representing knowledge In an 
Imperative form Is the use of a theorem prover In dealing with 
processes Involving a sequence of events. Consider the case of 
a robot manipulating blocks on a table. It might have data of 
the form, "blockl Is on biock2," "block2 Is behind blocks", and 
"If x Is on y and you put It on z, then x Is on z, and Is no 
longer on y unless y Is the sane as z"« Many examples In papers 
on theorem provers are of this form (for example the classic 
"monkey and bananas" problem). The problem Is that a 
declarative theorem prover cannot accept a statement like (ON Bl 
B2) at face value. It clearly Is not an axiom of the system, 
since Its validity will change as the process does on. It must 
be put In a form (ON Bl 82 SO) where SO Is a symbol for an 
Initial state of the world. See <Green 1969> for a discussion 
of such "state" problems. 

The third statement might be expressed as: 

(FORALL <X Y Z SXAND (ON X Y (PUT X Y S>) 

(ORtEQUAl Y Z> 

(NOTION X Z (PUT X Y S)))))) 

In this representation, PUT Is a function whose value Is 

the state which results from putting X on Y when the previous 

state was S. We run Into a problem when we try to ask (ON Z W 

(PUT X Y S>) I.e. Is block Z on block W after we put X on Y? A 

human knows that If we haven't touched Z or W we could just ask 

(ON Z W S) but In general It may take a complex deduction to 
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decide whether we have actually moved them, and even If we 

haven't* It will take a whole chain of deductions (tracing back 

through the time sequence) to prove they haven't been moved. In 

PLANNER/ where we specify a process directly, this whole type of 

problem can be handled In an Intuitively more satisfactory way 

by using the primitive function THERASE. 

Evaluating (THERASE (ON $7X S?Y)) removes the assertion (ON 

S?X $7Y) from the data base. If we think of theorem provers as 

working with a set of axioms. It seems strange to have a 

function whose purpose Is to erase axioms. If Instead we think 

of the data base as the "state of the world 11 and the operation 

of the prover as manipulating that state. It allows us to make 

great simplifications. Now we can simply assert (ON Bl B2) 

without any explicit mention of states. We can express the 

necessary theorem as: 

(DEFPROP THE0REM3 

(THCONSE (X Y Z) (PUT S?X $?Y) 
(THGOAL (ON S?X S?Z>) 
(THERASE (ON $?X $7Z)J 
(THASSERT (ON $?X *?Y))) 
THEOREM) 

This says that whenever we want to satisfy a goal of the 

form (PUT $?X $?Y), we should first find out what thlnR Z the 

thing X Is sitting on, erase the fact that It Is sitting on Z, 

and assert that It Is sitting on Y. We could also do a number of 

other things, such as proving that It Is Indeed possible to put 

X on Y, or adding a list of specific Instructions to a movement 

plan for an arm to actually execute the goal, In a more complex 



Section 3,3.5 - Pa 8 e 255 



case/ other Interactions might be Involved. For example. If we 
are keeping assertions of the form (ABOVE $?X STY) we would need 
to delete those assertions which became false when we erased (ON 
5?X $?2) and add those which became true when we added (ON $?X 
5?Y), ANTECEDENT theorems would be called by the assertion (ON 
$7X $?Y) to take care of that part/ and a similar group called 
ERASING theorems can be called In an exactly analogous way when 
an assertion Is erased, to derive consequences of the erasure. 
Again we emphasize that which of such theorems would be called 
Is dependent on the way the data base Is structured, and Is 
determined by knowledge of the subject matter. In this example, 
we would have to decide whether It was worth adding all of the 
ABOVE relations to the data base, v/lth the resultant need to 
check them whenever something Is moved, or Instead to omit them 
and take time to deduce them from the ON relation each time they 
are needed. 

Thus In PLANNER, the changing state of the world can be 
mirrored In the changing state of the data base, avoiding any 
need to make explicit mention of states, with the requisite 
overhead of deductions. This Is possible since the Information 
Is given In an Imperative form, specifying theorems as a series 
of specific steps to be executed. 

If we look back to the distinction between assertions and 
theorems made on the first page. It would seem that we have 
established that the base of assertions Is the "current state of 
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the world", while the base of theorems Is our permanent 
knowledge of how to deduce things from that state* This Is not 
exactly true, and one of the most exciting possibilities In 
PLANNER Is the capability for the program Itself to create and 
modify the PLANNER functions whTch make up the theorem base. 
Rather than simply making assertions, a particular PLANNER 
function might be written to put together a new theorem or make 
changes to an existing theorem. In a way dependent on the data 
and current knowledge. It seens likely that meaningful 
"learning 11 Involves this type of behavior rather than simply 
modifying parameters or adding more Individual facts 
(assertions) to a declarative data base. 
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3-3.6 PLANNER Functions 

There are a number of other PLANNER commands/ designed to 
suit a range of problem-solving needs. They are described In 
detail In <Hewltt 1969, 1970> and We will describe only those 
which are of particular use In our question answering program 
and which we will want to refer to later. 

We have already mentioned the basic functions and described 
how they operate. THGOAL looks for assertions In the data base, 
and calls theorems to achieve goals. THAND takes a list of 
PLANNER expressions and succeeds only If they all succeed In the 
order they are listed. THOR takes a similar list and tries the 
expressions In order, but succeeds as soon as one of them does. 
Remember that In case of a failure farther along In the 
deduction, THOR can take back Its decision and continue on down 
the list. The other simple LISP functions PROG, COND, and NOT 
have their PLANNER analogs, THPROG, THCOND, and THNOT, which 
operate just as their LISP counterparts, except that they arc 
controlled by the distinction between "failure" and "success" 
instead of the distinction between NIL and non-NIL. THPROG acts 
like THANO, failing If any one of its members falls. 

One of the most useful PLANNER functions Is THFIND, which 
Is used to find all of the objects or assertions satisfying a 
given PLANNER condition. For example. If we want to find all of 
the red blocks, we can evaluate* 
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(THFIND ALL $?X (X) 

(THGOAHBLOCK $?X)) 
(THGOAL(COLOR S?X RED))) 

The function THFIND takes four pieces of Information. 
First, there Is a parameter, telling It how many objects to look 
for. When we use ALL, It looks for as many as It can find, and 
succeeds If It finds any. If we use an Integer, it succeeds as 
soon as It finds that many, without looking for more. If we 
want to be more complex, we can tell It three things: a. how 
many It needs to succeed; b. how many It needs to quit looking, 
and c. whether to succeed or fall If It reaches the upper limit 
set In b. 

Thus If we want to find exactly 3 objects, we can use a 
parameter of (3 ** NIL), which means "Don't succeed unless there 
are three, look for a fourth, but If you find It, fall". 

The second bit of Information tells It what we want In the 
list It returns. For our purposes, this will always be the 
variable name of the object we are Interested In, The third 
Item Is a list of variables to be used, and the fourth Is the 
body of the THFIND statement. It Is this body that must be 
satisfied for each object found. It Is Identical to the body of 
a THPROG, and can have tags and THGO statements as wall as a 
series of expressions to be evaluated. 

Another function used extensively by the semantic 
Interpreter is THAHONG. This takes two arguments, the second Is 
a list, and the first Is the name of a variable. If the 
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variable Is assigned, THAHONG acts just like LISP MEMQ, 
succeeding If the value of the variable Is contained In the 
list. However, If the variable Is unasslgned, THAMONG assigns 
It to the first member of the list, then succeeds. If this 
causes a failure to back up to the THAMONG, It binds the 
variable Instead to the second member and tries again. This 
continues until the entire expression succeeds with some 
assignment or the list Is exhausted. In which case THAMONG 
returns failure. Using this, alone with the normal binding 
mechanism In THGOAL statements, failure can be used to run a 
loop through a list of objects which are specified by giving a 
PLANNER goal or arbitrary expression which they satslfy. 
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5,4 The BLOCKS World 
We need a subject to discuss with our language- 
understanding program which gives a variety of things to say and 
In which we can carry on a discourse, containing statements, 
questions/ and commands. We have chosen to pretend we are 
talking to a very simple type of robot (like the ones being 
developed In Al projects at Stanford and MIT) with only one arm 
and an eye. It can look at a scene containing toy objects like 
blocks and balls, and can manipulate them with Its hand. 

We have not tried to use an actual robot or to simulate It 
In physical detail. Since we are Interested primarily In 
complex language activity, we have adopted a very simplified 
model of the world, and the "robot" exists only as a display on 
the CRT scope attached to the computer. 
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First we must decide what objects we will have In the 
world. In 3.1, we adopted some conventions for notation In 
representing objects and assertions. Any symbol which begins 
with ":" represents a specific object, while anything beginning 
with u i u Is the name of a property or relation. 

The model begins with the two participants In the 
discussion, the robot (named :SHROLU), and the person (called 
:FRIENDK The robot has a hand (:HAND), and manipulates objects 
on a table (:TABLE), which has on It a box (:BOX). The rest of 
the physical objects are toys ■• blocks, pyramids, and balls. 
We give them the names :B1, :B2, :B3,... 

Next we must decide on the set of concepts we will use to 
describe these objects and their properties. We can represent 
these In the form of a tree: 





'TABLE 






'BOX I'BLOCK 






'PHYSOB — 


'MANIP I'BALL 






'ROBOT 


'HAND ('PYRAMID 






'PERSON 
'PROPERTY- 


'STACK 

I'COLOR 

HSHAPE 




Figure 5i* — Classification of Objects and Properties 





The symbol 'PHYSOB stands for "physical object", and 'MANIP for 
"menlpulable object" (I.e. something the robot can pick up). 
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We could use these as simple predicates, and have 
assertions like (*R0B0T rSHRDLU), (*HAND :HAND>, and ('PYRAMID 
:B5) to say that Shrdlu Is a robot, the hand Is a hand, end tBS 
Is a pyramid. In section k.k.l, we describe the way the 
language programs choose an English phrase to describe an 
object. In order to do so, they need a basic noun -- the one we 
would use to say "this Is a ...". If we represented the 
concepts In the above tree using simple predicates, and then 
used the same form for other predicates, such as colors (for 
example^ (*BLUE :B5>), the language generating routines would 
have no easy way to know which was the "basic" property. It 
would be necessary to keep lists and continually check. 
Instead, we adopt a different way of writing these concepts. We 
use the concept #IS to mean "has as Its basic description", and 
write <*IS :SHRDLU IROBOT), (MS :HAND #HAND), and (#IS :B5 
IPYRAMID). 

Looking at the tree, we see that the properties *PHYS0B and 
#MANIP cannot be represented In this fashion, since any object 
having them also has a basic description. We therefore write 
<#MANIP :B5) and (#PHYSOB :TABLE). 

Next, we would like to assign physical properties to these 
objects, such as si ze, shape, color, and location. Shape and 
color are handled with simple assertions like (#COLOR :BOX 
WHITE) and (♦SHAPE :B5 ^POINTED). The possible snapes are 
#ROUND, 4P0INTED, AND #RECTANGULAR, and the colors are *BLACK, 
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IREO, #WHITE, *GREEN, and #BLUE. Of course It would Involve no 
programming to Introduce other shape or color names — all that 
we would do Is use them In an assertion, like ('COLOR :B11 
♦MAUVE), and add an assertion telling what type of thing they 
are. The property names themselves can be seen as objects, and 
we have the concepts ICOLOR and tSHAPE, to make assertions like 
MIS #BLUE *COLOR), and (#IS ♦RECTANGULAR *SHAPE) . 

Size and location are more complex, as they depend on the 
way we choose to represent physical space. We have adopted a 
standard three-dimensional coordinate system, wl th coord I nates 
rangln£ from to 1200 In all three directions, (The number 
1200 was chosen for convenience In programming the display). 
The coordinate point (0 0) Is In the front lower left-hand 
corner of the scene. 

We have made the simplifying assumption that objects are 
not allowed to rotate, and therefore always keep their 
orientation aligned with the coordinate axes. We can represent 
the position of an object by giving the coordinates of Its front 
lower left-hand corner, and can specify Its size by giving the 
three dimensions. We use the symbols 'SIZE and #AT, and put the 
coordinate triples as a single element In the assertions. For 
example, we might have CAT I B5 < 400 600 200)), and ('SIZE :B5 
(100 100 300)K 

Since we assume that the robot has an eye, the system 
begins the dialog with complete Information about the objects In 
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the scene, their shapes, sizes, colors, and locations. In 
addition to the PLANNER assertions, the system keeps a table of 
sizes and locations for more efficient calculation when looking 
for an empty space to set something down. 
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3.li-2 Relations 

The basic relations we will need for this model are the 
spatial relations between objects. Since we are Interested In 
moving objects around In the scene, one of the most Important 
relations Is 'SUPPORT. The Initial data base contains all of 
the applicable support relations for the Initial scene, and 
every time an object Is moved, an antecedent theorem removes the 
old assertion about what was supporting It, and puts In the 
correct new one* We have adopted a very simplified notion of 
support. In which an object Is supported by whatever Is directly 
below Its center of gravity, at the level of Its bottom face. 
Therefore, an object can support several others, but there Is 
only one thing supporting It, Of course this Is an extreme 
simplification since It does not recognize that a simple bridge 
Is supported. If this program were to be adapted to use with an 
actual robot/ a much more general Idea of support would be 
necessary. Along with the #SUPP0RT relations, we keep track of 
the property #CLEARTOP. The assertion <#CLEARTOP X) will be In 
the data base If and only If there Is no assertion ('SUPPORT X 
Y) for any object Y, It Is also kept current by antecedent 
theorems which are called whenever an object Is moved. This 
happens automatically whenever an assertion of the form (#AT OBJ 
(X Y Z)) Is made. The theorems make the appropriate check to 
see whether the 'CLEARTOP status of any object has changed, and 
If so the necessary erasures and assertions are made. 
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A second relation which Is kept In the data base Is 
'CONTAIN. The first participant must be the box, since this Is 
the only container In the scene. The Information about what Is 
contained In the box Is also kept current by an antecedent 
theorem. The relation 'GRASPING Is used to Indicate what 
object (If any) the robot's hand Is grasping. It Is 
theoretically a two-place predicate, relating a grasper and a 
graspee, as In ('GRASPING :SHRDLU :B2). Since there Is only one 
hand In our scene. It Is clear who must be doing the grasping, 
so the assertion Is reduced to ('GRASPING :B2). 

The other relation which Is stored In the data base Is the 
'PART relation between an object and a stack. We can give a 
name to a stack/ such as :S1, and assert ('PART :B2 :S1). As 
objects are moved, the changes to the data base are again made 
automatically by antecedent theorems which notice changes of 
location. 

As we explained In section 3*3*3* we must decide what 
relations are useful enough to occupy space In our data base, 
and which should be recomputed from simpler Information each 
time we need them. He have Included relations like 'SUPPORT and 
'CONTAIN because they are often referenced In deciding how to 
move objects. We can think of other relations, such as the 
relative position of two objects, which can be computed from 
their locations, and are not used often enough to be worth 
keeping In the data base and partially recomputing every time 
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something ts moved. We represent these relations using the 
symbols 'RIGHT, 'BEHIND, and 'ABOVE. (These represent the 
direction of the positive coordinate axis for X, Y, and Z 
respectively). We do not need the converse relations, since we 
can represent a fact like n :Bl Is below :B2" by ('ABOVE :B2 
:B1), and our semantic system can convert what Is said to this 
standard format. The symbol 'ON Is used to represent the 
transitive closure of 'SUPPORT. That Is, Z Is 'ON A If A 
supports B, B supports C, . . .supports Z, 

The three spatial relations use a common consequent theorem 
called TC-LOC which decides If they are true by looking at the 

coordinates and sizes of the objects. The 'ON relation has a 
consequent theorem TC-ON which looks for chains of support. 
(Notice that the prefix TC- stands for Theorem Consequent, and 
Is attached to all of our consequent theorems. Similarly, TA- 
and TE- are used for antecedent and erasing theorems*) 

The measurements of #HEIGHT, 'WIDTH, and 'LENGTH are 
represented as a simple assertion, like ('HEIGHT :B5 100), but 
they are not stored In the data base. They are computed when 
needed from the 'SIZE assertion, and can be accessed by using 
the theorem TC-MEASURE, or by using a functional notation. The 
expression ('HEIGHT X) evaluates to the height of whatever 
object the variable X Is bound to. If 'SIZE Is used In this 
way. It returns a measure of "overall size" to be used for 
comparisons like "bigger". Currently It returns the sum of the 
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X, Y, and Z coordinates, but It could be easily changed to be 
more In accord with human psychology. 

In order to compare measurements, we have the relation 
#MORE. The sentence ":B1 Is shorter than :B2 M Is equivalent to 
the assertion (#MORE 'HEIGHT :B2 :B1). Again, we do not need 
the relation "less" since we can simply reverse the order of the 
objects. The relation JA5MUCH Is used In the same way, to 
express "greater than or equal". Instead of "strictly greater 
than". None of these assertions are stored (If we have ten 
objects, there will be almost t*O0 relationships), but are 
computed from more basic Information as they are needed. 

One final relationship Is 'LIKE, which relates a person or 
robot to any object. There Is a theorem which shows that the 
robot likes everything, but knowledge about what the human user 
likes Is gathered from his statements. The semantic programs 
can use statements about liking to generate further PLANNER 
theorems which are used to answer questions about what :FRIEND 
1 Ikes. 
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UU3 Actions 

The only events that can take place In our world are 
actions taken by the robot In moving Its hand and manipulating 
objects. At the most basic level, there are only three actions 
which can occur — MOVETO, GRASP, and UKGRASP. These are the 
actual commands sent to the display routines, and could 
theoretically be sent directly to a physical robot system. 

The result of calling a consequent theorem to achieve a 
goal requiring motion, like (#PUTON :B3 :BO, Is a plan -- a 
list of Instructions using the three elementary functions. 
MOVETO moves the hand and whatever It Is currently grasping to a 
set of specified coordinates, GRASP sets an Indicator that the 
grasped object Is to be moved along with the hand, and UNGRASP 
unsets It. The robot grasps by moving Its hand directly over 
the center of the object on Its top surface, and turning on a 
"magnet". It can do this to any manlpulable object, but can 
only grasp one thing at a time. Using these elementary actions, 
we can build a hierarchy of actions. Including goals which may 
Involve a whole sequence of deductions and actions, like 
#STACKUP. 

The semantic programs never need to worry about details 
Involving physical coordinates or specific motion Instructions, 
but can produce Input for higher-level theorems which do the 
detal led work. 

At a slightly higher level, we have the PLANNER concepts 
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fMQVEHAND, #GRASP and 4UNGRASP, and corresponding consequent 
theorems to achieve them. There Is a significant difference 
between these and the functions listed above. Calling tne 
function M0VETO actually causes the hand to move. On the other 
handy when PLANNER evaluates a statment I Ike: 

(THGOALUMOVEHAND (600 200 300))(THUSE TC-MOVEHAND)) 
nothing Is actually moved. The theorem TC-MOVEHAND Is called, 
and It creates a plan to do the motion, but If this move causes 
us to be unable to achieve a goal at some later point, thft 
PLANNER backup mechanism will automatlcaly erase It from the 
plan. The robot plans the entire action before actually moving 
anything, trying all of the means It has to achieve Its goal. 

The theorems also do some checking to see If we are trying 
to do something Impossible. For example, TC-H0VEHAN0 makes sure 
the action would not Involve placing a block where there Is 
already an object, and TOUNGRASP falls unless there Is 
something supporting the object It wants to let go of. 



Section 3.*».ii - Page 271 



i-Jui CtirrYins Gk£ Commands 

Some theorems, like TC-GRASP, are more complex, as they can 
cause a series of actions. In this section we will follow 
PLANNER through such an action, using the simplified theorems of 
figure 55. If PLANNER tries the goal: 

(THGOAL <#GRASP :81}(THUSE TC-GRASP)) 
the theorem TC-GRASP can do a number of things. It checks to 
make sure :B1 Is a graspable object by looking In the data base 
for UMANIP :B1), If the hand Is already grasping the object. 
It has nothing more to do. If not. It must first get the hand 
to the object. This may Involve complications -- the hand may 
already be holding something, or there may be objects sitting on 
top of the one It wants to grasp. In the first case. It must 
get rid of whatever Is In the hand, using the the command #GET- 
RID-OF. The easiest way to get rid of something Is to set It 
on the table, so TC-GET-R1D-0F creates the goal <#PUTON $?X 
:TABLE), where the variable $?X Is bound to the object the hand 
Is holding. TC-PUTON must In turn find a big enough empty place 
to set down Its burden, using the command *FINDSPACE, which 
performs the necessary calculations, using Information about the 
sizes and locations of all the objects. TC-PUTON then creates a 
goal using #PUT, which calculates where the hand must be moved 
to get the object Into the desrlred place, then calls #MOVEHAND 
to actually plan the move. If we look at the logical structure 
of our active goals at this point, assuming that we want to 
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(DEFTHEOREM TC-CLEARTOP 

(THCONSE (X Y) (KLEARTOP $?X) 

GO (THCONO ((THGOAL (#SUPPORT S?X $_Y ) ) 

(THGOAL (IGET-RID-OF $?Y> 

(THUSE TC-GET-RID-OF)) 

(THGO GO)) 
((THASSERT UCLEARTOP S?X)))))) 

(DEFTHEOREM TC-GET-RID-OF 

(THCONSE (X Y) UGET-RID-OF $?X) 
(THOR 
(THGOAL CPUTON $?X : TABLE) (THUSE TC-PUTON)) 
(THGOAL (#PUT0M S?X $?Y)(THUSE TC-PUTON))))) 

(DEFTHEOREM TC-GRASP 

(THCONSE (X Y) (#GRASP $?X) 

(THGOALUMANIP $?X)> 
(THCOND ((THGOAL UGRASPING $?X))) 
((THGOAL ('GRASPING $_Y)> 
(THGOAL UGET-RID-OF S?Y) 

(THUSE TC-GET-RID-OF)))) 

U)) 

(THGOAL UCLEARTOP S?X) (THUSE TC-CLEARTOP)) 
(THSETQ $ Y (TOPCENTER $?X)) 
(THGOAL UMOVEHAND *?Y) 

(THUSE TC-MOVEHAND)) 
(THASSERT (#GRASPING $?X)))) 

(DEFTHEOREM TC-PUT 

(THCONSE (X Y Z) <#PUT $?X $?Y) 

(CLEAR $?Y (SIZE $?X) *?X) 

(SUPPORT $?Y (SIZE $?X) $?X) 

(THGOAL UGRASP S7X) (THUSE TC-GRASP)) 

(THSETO $_Z (TCENT S?Y (SIZE J?X))) 

(THGOAL (#MOVEHAND $?Z) (THUSE TC-MOVEHAND) ) 

(THGOAL (#UNGRASP) (THUSE TC-UNGRASP) ) ) ) 

(DEFTHEOREM TC-PUTON 

(THCONSE (X Y Z) UPUTON $?X S?Y) 
(NOT (EQ $?X J?Y)) 
(THGOAL (IFINDSPACE I?Y JE (SIZE $?X) $?X $_Z) 

(THUSE TC-FINDSPACE TC-MAKESPACE) ) 
(THGOAL (#PUT $?X S?Z) (THUSE TC-PUT)))) 

Figure 55 — Simplified PLANNER Theorems 
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grasp :B1, but were already grasping :B2, we see: 

('GRASP :B1J 

('GET-RID-OF :B2) 

('PUTON :B2 :TABLE) 

('PUT :B2 ( l»53 201 0)) 

('MOVEHAND (553 301 100)} 

After moving, TC-PUT calls 'UNGRASP, and we have achieved 

the first part of our original eoal — emptying the hand. Now 

we must clear off the block we want to grasp. TC-GRASP sets up 

the goal: 

(THG0ALCCLEART0P :B2)(THUSE TC-CLEARTOP) ) 
This Is a good example of the double use of PLANNER goals to 
both search the data base and carry out actions. If the 
assertion UCLEARTOP :B1) Is present. It satisfies this goal 
Immediately without calling the theorem. However If :B1 Is not 
already clear, this THGOAL statement calls TC-CLEARTOP which 
takes the necessary actions. 

TC-CLEARTOP will try to 'GET-RID-OF the objects on top of 
:B1. This will In turn use 'PUTON, which uses 'PUT. But TC-PUT 
may have more to do this time, since the hand Is not already 
grasping the object It has to move. It therefore sets up a goal 
to 'GRASP the object, calling TC-GRASP. We have gone full 
circle, and are back In the theorem we started In. Of course 
this Is no problem In a recursive language, and the process 
continues as It should until all of the goals have been 
achieved, or all of the methods Included In the theorems have 
failed. 
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We have gone through this example In some detail to give a 
feeling for the goal-oriented programs used by PLANNER. The 
programs are highly recursive, with a clear subgoal structure, 
and with theorems often calling themselves to achieve subgoals. 
During all of this, PLANNER Is keeping track of what Is being 
done In such a way that It can back up and try something 
different If necessary. For example. If TC-GET-RID-OF puts an 
object on the table, and It later Is In the way of something 
which must be done, a failure will propagate back to TC-GET-RID- 
OF, and It will try putting It somewhere else. 

Figure 56 lists the different action concepts more 
systematically, showing the form of the goal statements, and the 
actions taken by the theorems corresponding to them. In our 
program, the names of the theorems are formed by adding the 
prefix TC- for a consequent theorm, TA- for antecedent, and TE- 
for erasing. This is strictly for programmer convenience In 
recognizing the purpose of a theorem from Its name. All of 
these goals can be used Internally within the BLOCKS system, and 
most of them can be called directly by the semantic programs, as 
direct translations of English commands. Some, like *MOVEHAND 
cannot be called by the linguistic programs, since the semantic 
routines do not include a way to specify exact coordinates In 
Engl Ish. 
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Command 



Effect 



(IMOVEMAND (X Y Z)> Hove the center of the hand to location 

(X Y Z). Anything being grasped goes 
along automatically. 



UUNGRASP) 

(#GRASP X) 

(*PUT W (X Y Z)) 

URAISEHAND) 

UPICKUP X) 
HPUTON X Y) 

CtPUTIN X Y) 

(#GET-RI0-0F X) 
(#CLEART0P X) 



Let go of whatever the hand Is holding. 
Falls If the object Is not supported. 

Grasp object X, doing any manipulations 
necessary to get to It. 

Put W at location (X Y Z). Falls unless 
the space Is empty. 

Raise the hand and whatever It holds as 
high as it will go. 

Grasp X and raise the hand. 

Put object X on object Y. If there Is 
not enough free space on Y, move 
objects to make It. 

The same as 'PUTOM, except that Y must be 
a box, and different methods are used 
to find space. 

Try to put X on the table, and If that 
falls put It on anything else. 

Get rid of everything which Is on X. 



USTACKUP (X Y...)) Stack X, Y, ... on top of each other. 

In order of si ze. 

(#FINDSPACE A (X Y Z) B $_C) 

This goal can be achieved by two different 
theorems. TC-FINDSPACE tries to find a 
space of size (X Y Z) on top of object 
A, counting any space occupied by B as 
empty. $_C Is a variable binding used 
to return the answer. If this falls, 
TC-MAKESPACE can create the space by 
moving objects. 



Figure 56 — BLOCKS Commands 
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Uul Memory 

In order to answer questions about past events, the BLOCKS 
programs remember selected parts of their subgoal tree. They do 
this by creatine objects called events, and putting them on en 
EVENTLIST. The system does not remember the detailed series of 
specific steps like *MOVEHAND, but keeps track of the larger 
goals like #PUTON and #STACKUP. The time of events Is measured 
by a clock which starts at and Is Incremented by 1 every time 
any motion occurs. The theorems which want to be remembered use 
the functions MEMORY and MEMOREND, calling MEMORY when the 
theorem Is entered and MEMOREND when It exits, MEMOREND causes 
an event to be created, combining the original goal statement 
with an arbitrary name (chosen from El, E2,...). Recall from 
Section J.l that a relation can Include Its own name, so that 
other relations can refer to It. If we call TOPUTOH with the 
goal (*PUTON $?X J?Y), with the variables X and Y bound to :B1 
and :B2 respectively, the resulting event which Is put Into the 
data base Is <*PUTON EX ;B1 :B23. The event name Is second. 
Instead of last as described In 3.1 for unimportant technical 
reasons which will be changed In later versions. 

In addition to putting this assertion In the data base, 
MEMOREND puts Information on the property list of the event name 
-- the starting time, ending time, and reason for each event. 
The reason Is the name of the event nearest up In the subgoal 
tree which Is being remembered. The reason for goals called by 



Section 3.b. 5 - Page 277 



the linguistic part of the system Is a special symbol meaning 
"because you asked me to". MEMORY Is called at the beginning of 
a theorem to establish the start time and declare that theorem 
as the "reason 11 for the subgoal s It cal 1 s. 

A second kind of memory keeps track of the actual physical 
motions of objects, noting each time one Is moved, and recording 
Its name and the location It went to. This list can be used to 
establish where any object was at any past time. 

When we want to pick up block :B2, we can say: 

<THGOAL(*PICKUP :B1)), and It Is Interpreted as a command. How 

can we ask "Did you pick up :B1? M ? When the robot picked It up, 

an assertion like (#PICKUP E2 :81) was stored In the data base. 

Therefore If we ask PLANNER 

(THPROG(X) 

(THGOAL ('PICKUP $TX :B1>)) 

it will find the assertion, binding the variable X to the event 

name E2. Since the property list of E2 gives its starting and 

ending times, and Its reason, this Is sufficient Information to 

answer most questions. 

If we want to ask something like "Old you pick up :B1 

before you built the stack?" we need some way to look for 

particular time Intervals. This Is done by using a modified 

version of the event description. Including a time Indicator. 

The exact form of the time Indicator Is described In the section 

on semantics, but the way It Is used to establish a goal Is: 

(THGOALUPICKUP J7X : Bl J7TIME) (THUSE TCTE-PI CKUP) ) 
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The prefix TCTE- on the name of a theorem means that It 
includes a time and an event name. Ordinarily when such a 
theorem Is entered, the variable TIME would have a value, while 
the variable X would not. The theorem looks through the data 
base for stored events of the form UPICKUP S?X :B1) and checks 
them to see If they ap.ree with the time TIME. 

For some events, like #PUT0N, this Is sufficient since the 
system remembers esrery #PUTON It does. For others, like rfPICKUP 
less Information Is kept. When #PICKUP is called as a goal at 
the top level. It Is remembered. But the system does not 
remember each time something was picked up In the course of 
moving the toys around. The fact that a block was picked up can 
be deduced from the fact that It was put somewhere, and the 
theorem TCTE-PICKUP actually looks at a number of different 
types of events (like IPUTOH and #PUTIN> to find all the 
occasions on which an object was really picked up. 

For spatial relations, we also need to be able to Include 
time, for example, "Was the block behind the pyramid before.,.? 11 
In this case, no assertions ^re stored, since the memory of 
motion events Is sufficient to reconstruct the scene. There are 
special theorems with the prefix TCT- which try to verify a 
relation with a time condition. For example, we can ask "Is :B1 
on :B2? M with the goal 

(THGOALUON :B1 :82)<THUSE TC-ON)) 
To ask "Was : Bl on :B2 before...?" we bind the variable TIME to 
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the representation of the time we are Interested In, and ask 

<THG0AL(#0N :BI :B2 J?TIME) (THUSE TCT-ON)) 
The theorem TCT-ON Is the same as TC-ON except that It deals 
with the specified time Instead of the present. Similar TCT- 
theorems exist for all of the spatial relations, and for 
properties which change In time, such as *CLEART0P and *AT* 

Appendix C Is a listing of parts of the BLOCKS program as 
It was used In generating the sample dialog. PLANNER theorems 
are sufficiently self-explanatory that only a few comments have 
been Included. The auxllllary LISP functions (such as the one 
which decides whether a space Is empty) are not Included, but 
are available on request, along with the complete BLOCKS 
I Istlngs. 



Section tt. 1.1 - Page 280 



Chapter U . Semantics 



i».l What Is Semantics? 

"i-l-l Ill£ Province flf Semantics 

The field of semantics has always been a hazy swampland. 
There Is little agreement among "semantic! sts" where Its borders 
lie or what the terrain looks like. Logicians, philosophers, 
and linguists all approach It with the tools of their own trade, 
and the problem of just defining "semantics 11 and "meaning 11 have 
occupied volumes of debate. 

In trying to program computers to understand natural 
language. It has been necessary to have a more explicit and 
complete notion of semantics. The attempts at writing language 
understanding programs have made It more clear just what a 
semantic theory has to do, and how It must connect with the 
syntactic and logical aspects of language. In practical terms, 
we need a transducer which can work with a syntactic analysis, 
and produce data which Is acceptable to a logical deductive 
system. 

In the preceding chapters we have described the two ends of 
a language system -- a syntactic parser with a grammar of 
English, and a deductive system with a base of knowledge about a 
particular subject. What does our semantic theory have to do to 
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fill the gap? 

In section 3.1.2 we outlined the basis for a theory of 

semantics. It Includes a world of "concepts' 1 and structures of 

concepts which are postulated by the linguist In trying to 

explain linguistic phenomena. These are not a psychological 

reality, but a formalism In which he can systematically express 

those aspects of meaning which are relevant to language use. By 

manipulating structures In this formalism as a part of analyzing 

sentences In natural language, the theory can directly deal with 

problems of relating Waning to parts of the speaker's and 

hearer's knowledge which are not mentioned explicitly In Che 

sentence being analyzed. 

A semantic theory must describe the relationship between 
the words and syntactic structures of natural language and the 
postulated formalism of concepts and operations on concepts. In 
our theory, this relationship Is described as a set of 
procedures which analyze linguistic forms to produce 
representations of meaning In the Internal conceptual formalism. 
Just as with the grammar, this does not purport to be a model of 
an actual process taking place In the hearer or speaker. The 
process description Is used because It Is a powerful way to 
describe "neutral" relationships, as well as being 
psychologically suggestive. 

The theory must describe relationships at three different 

levels. First, there must be a way to define the meanings of 

words. We pointed out In the section on "meaning" (section 

3.1) that the real "meaning" of a word or concept cannot be 

defined In simple dictionary terms, but Involves Its 

relationship to an entire vocabulary and structure of concepts. 

However, we can talk about the formal description attached to a 
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word which allows It to be Integrated Into the system. In the 
rest of this chapter, we will use the word "meaning" In thfs 
more limited sense, describing those formal aspects of the 
meaning of a word (or syntactic construction) which are attached 
to it as its dictionary definition. 

The formalism for definitions should not depend on the 
details of the semantic programs, but should allow users to add 
to the vocabulary In a simple and natural way. It should also 
be possible to handle the quirks and Idiosyncracles of meaning 
which words can have. Instead of limiting ourselves to "well- 
behaved 11 standard words. 

At the next level we must relate the meanings of the words 
in a sentence to each other and to the meaning of the syntactic 
structures. Ue need an analysis of the ways in which English 
structures are designed to convey meaning, and what role the 
different words and syntactic features play In this meaning. 

Finally, a sentence In natural language is never 
Interpreted In Isolation. It Is always part of a context, and 
Its meaning Is dependent on that context. A theory should 
explain the different ways In which the "setting 11 of a sentence 
can affect Its meaning. it must deal both with the linguistic 
setting (the context within the discourse) and the real-world 
setting (the way meaning Interacts with knowledge of non- 
1 lngulstlc facts.) 
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^1-2 Ihfi Semantic System 

With definite goals In mind for a semantic system, we can 
consider how to Implement It. First let us look at what It 
should know about English. As we have been emphasizing 
throughout the paper, a language Is not a set of abstract 
symbols. It Is a system for conveying meaning, and has evolved 
with very special mechanisms for conveying just those aspects of 
meaning needed for human comnunlcat Ion, 

Section 3.1 discussed the person's "model of the world' 1 
which Is organized around notions of "objects", having 
"properties" and entering Into "relationships." In 3.1,3, these 
are combined to form more complicated logical expressions. 
Looking at the properties of English syntax (as described In 
Section 2.3) we see that these basic elements of the "world 
model" are just what English Is good at conveying. 

For describing objects, there Is the NOUN GROUP, It 
contains a noun, which Indicates the kind of object; adjectives 
and classifiers, which describe further properties of the 
object; and a complex system of quantifiers and determiners 
describing Its logical status — whether It Is a particular 
object, ("the sun"), a class of objects ("people"), a 
particular set of objects ("John's lizards"), an unspecified set 
containing a specified number of objects ("three bananas"), etc. 
The details (described In section 4.2) are complex, but the 
Important thing Is the existence of a systematic structure. 
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For describing relationships and events, there are the 
CLAUSE, PREPOSITION GROUP, and ADJECTIVE GROUP. The CLAUSE Is 
especially suited for dealing wEth relationships having a 
particular time reference, working In coordination with the VERB 
GROUP, which functions to convey Information about tine, using 
an Ingenious system of tenses. Clauses can also be used to 
represent an event or relationship as an object (as In " HI s 
going pleased me. 1 '), or to modify a particular object within a 
NOUN GROUP {In f, the man #hQ broke the bank "). The PREPG Is a 
less flexible and simpler way of expressing relationships which 
do not need modifiers such as time, place, and manner (such as 
n the man J_q the blue vesfr H ) » The AOJG Is used In some 
constructions to describe properties and some special kinds of 
relationships of objects (such as "Her gift was bigger than a 
breadbox.") 

The semantic system Is built around a group of about a 
dozen programs which are experts at looking at these particular 
syntactic structures. They look at both the structures and the 
meanings of the words to build up PLANNER expressions which will 
be used by the deductive mechanism. It Is Important to remember 
that the parser uses systemic grammar so the semantic programs 
can look directly for features such as PASSIVE or PLURAL or 
QUESTION to make decisions about the meaning of the sentence or 
phrase. 

Since each of these semantic "special Ists" can work 
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separately, there Is no need to wait for a complete parsing 
before beginning semantic analysis. The NOUN GROUP specialist 
can be called as soon as a NOUN GROUP has been parsed, to see 
whether It makes sense before the parser goes on. In fact, the 
task can be broken up, and a preliminary NOUN GROUP specialist 
can be called In the middle of parsing (for example, after 
finding the noun and adjectives, but before looking for 
modifying clauses or prepositional phrases) to see whether It Is 
worth continuing, or whether the supposed combination of 
adjectives and noun Is nonsensical. The grammar Is In the form 

of a program, so It Is just as easy to call a semantic routine 
at any time as a syntactic one. Any semantic program has full 
power to use the deductive system, and can even call the grammar 
to do a special bit of parsing before going on with the semantic 
analysis. For this reason It Is very hard to classify the 
semantic analysis as "top-down 11 or "bottom-up". In general each 
piece of the structure Is analyzed as It Is parsed, which Is a 
bottom-up approach. However whenever there Is a reason to delay 
a part of the analysis until some of the larger structure has 
been analyzed. It Is just as easy to write the semantic 
specialist programs In this too-down manner. In our system both 
approaches are used. 
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fr-1.3 Words 

A semantic system needs to deal with two different kinds of 
words. Some words are Included In the general knowledge of the 
English language. Words like "that" or "than 11 . In "He knew that 
they were madder than hornets," would be difficult to define 
except In terms of their place In the sentence structure. They 
are being used as signals of certain syntactic structures and 
features, and have no meaning except for this signalling (which 
Is recognized by the grammar). These are often called "function 
words" In distinction to the "content words" which make up the 
bulk of our vocabulary. This Is not a sharp distinction, since 
many words serve a combination of purposes (for example, numbers 
are basically "function words", but each one has Its unique 
meaning). We can generally distinguish between words like 
"that" and "than" whose meanings ere built Into the system, and 
words like "snake", "under", and "walk", which surely are not. 

The definitions of content words should not have to Include 
"expert 11 knowledge about the semantics or grammar of the 
language. In defining the word "mighty", we should not have to 
worry about whether It appears In "The sword Is mighty," or 

"the mightiest warrior", or "a man mightier than a locomotive." 
We should be able to say "'Mighty 1 means having the property 
represented conceptually as JMIGHT.", and let the semantic 

system do the rest. 

We need a semantic language for expressing def inl tlons In a 
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way which does not depend on the grammar or the particular 
semantic programs. Each of our "specialists" which looks at the 
meanings of words should be able to Interpret those statements 
In the semantic language which might be relevant to Its job. 

Section b.2 describes simple formats for defining common 
verbs, nouns, adjectives, and prepositions, and In fact, 
these definitions do not look much like programs at all. Why 
then do we call this a "language" Instead of saying that we have 
a set of special formats for defining words? The distinction 
becomes Important for all of the Irregular cases and the 
Idlosyncracles that words can have. For example. In "The block 
Is on the roof of the car.", "the roof of the car" Is a NG 
referring to a particular object which Is a roof. But If we say 
"The block Is on the right of the box", we are not referring to 
a particular object which Is a "right". The normal NG mechanism 
for describing objects Is being used Instead to describe a 
relationship between the block and the box. We could reprogram 
our NOUN GROUP semantic specialist to recognize this special 
case and treat It differently, but this Is a path leading to a 
roadblock. We will not be able to anticipate every case, and as 
the program becomes more and more patched. It will become harder 
to change and less likely to work. 

What we need Is a flexible way of defining words, so that 
the word "right" Itself can cause the right things to happen In 
semantic Interpretation, without changing the system. This Is 
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achieved by letting the definition of each word be a LISP 
program to be run at an appropriate time In the semantic 
analysis. For simple cases, there are standard functions with a 
special format for usual types of definitions. In the complex 
cases there Is a platform from which to operate, doing whatever 
calculations and changes of the environment are needed. This 
flexibility Is Important In many places. For example, the word 
"one" when used as a noun (as In "the green one") has a special 
use for referring back to previously mentioned nouns. It could 
not be defined by a simple format, as could "block" or "dog", 
since It Involves complex decisions about what Is really being 
referred to, and needs access to the previous discourse. In our 
system, its definition as a noun Is compatible with the 
definitions of all other nouns -- the semantic specialists don't 
know anything about tt. When the NG specialist Is ready to use 
the definition of the noun. It calls 1 t as a program. In the 
usual case, this program sets up a standard data structure. In 
the case of "one". It calls a heuristic program for 
understanding back-references, and Its effect on the meaning 
will depend on the discourse. Similarly, the verb "be" is 
called like any other verb by the semantic specialist, but In 
fact Its definition Is a complex program describing Its 

different uses. 

The use of procedures to represent meanings of words gives 
a flexibility which allows these exceptional words to be handled 
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as well as the more ordinary forms. At the same time. It 
provides a strict test of representations of procedures for 
particular words, since the procedures can actually be run In 
the system. 
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ii.l.i* Amblcultv 

A semantic theory must have some way to account for 
multiple meanings of words, phrases, and sentences. We would 
like to explain not only how multiple interpretations can occur, 
but also how the hearer sorts them out to pick a single meaning. 

As a start, we must allow words to have several "senses", 
and must be able to have multiple Interpretations of phrases and 
sentences to correspond to them. Next we must realize that the 
syntactic structures can also lead to semantic ambiguities. 
Sentences like the famous "Time flies like an arrow." derive 
some of their ambiguity from their ability to be analyzed 
syntactically In more than one way. Finally, we Include some 
ambiguities as a result of the semantic analysis. The sentence 
"A man sitting In this room fired the fatal shot." will be 
ambiguous even If we agree on a single meaning for each word, 
and a surface structure for the sentence. If spoken by Perry 
Mason at a dramatic moment in the courtroom. It means "a man who 
1§. sitting In this room", but If spoken by the detectives when 
they broke Into the empty hotel room across the street from the 
scene of the crime. It means "who wfis. sitting In this room". 
This could be treated as a syntactic ambiguity In the deep 
structure, but In our analysis It Is Instead treated as a 
semantic ambiguity Involving the time reference. 

In describing Che grammar It was pointed out that we do not 
carry forward simultaneous parsings of a sentence. We try to 
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find the "best" parsing, and try other paths only If we run Into 
trouble. In semantics we take the other aoroach. If a word has 
two meanings, then two semantic descriptions are built 
simultaneously, and used to form two separate phrase 
I nterpretat Ions. 

We can Immediately see a problem here. There Is dire 
danger of a combinatorial explosion. If words A* B, C, and D 
each have thrpe meanings, then a sentence containing all of them 
may have 3x3x3x5, or 81 Interpretations. The possibilities for 
a long sentence are astronomical. 

Of course a person does not build up such a tremendous 
list. As he hears a sentence, he "filters out" all but the 
most reasonable Interpretations. We know that a "ball" can be 
either a spherical toy or a dancing party, and that "green" can 
mean either the color green, or unripe, or Inexperienced. But 
when we see "the green ball", we do not get befuddled with six 
Interpretations, we know that only one makes sense. The use of 
"green" for "unripe" applies only to fruit, the use as 
"Inexperienced" applies only to people, and the color only to 
physical objects. The meaning of "ball" as a party fits none of 
these categories, and the meaning as a "spherical toy" fits only 
the last one. We can subdivide the world into rough classes 
such as "animate", "Inanimate", "physical", "abstract", "event", 
"human", etc. and can use this classification scheme to filter 
out meaningless combinations of Interpretations. 
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Some semantic theories <Foder> are based almost completely 
on this Idea. We would like to use It for what It Is -- not a 
complete representation of meaning, but a rough classification 
which eliminates fruitless semantic Interpretations. Our system 
has the ability to use these "semantic markers" to cut down the 
number of semantic Interpretations of any phrase or sentence. 

A second method used to reduce the number of different 
semantic Interpretations Is to do the Interpretation 
continuously. We do not pile up all possible Interpretations 
of each piece of the sentence, then try to make logical sense of 
them together at the end. As each phrase Is completed. It Is 
understood. If we come across a phrase like "the colorful ball" 
In context, we do not keep the two different possible 
Interpretations In mind until the utterance Is finished. We 
Immediately look In our memory to see which Interpretation Is 
meaningful In the current context of discourse, and use only 
that meaning In the larger semantic analysis of the sentence. 
Since our system allows the grammar, semantics and deduction to 
be easily Intermixed, It Is possible to do this kind of 

continuous Interpretation. 

Finally we must deal with cases where we cannot eliminate 
all but one meaning as "senseless". There will be sentences 
where more than one meaning makes sense, and there must be some 
way to choose the correct one In a given context. In the 
section on context below, we discuss the use of the overall 



Section 4.I.I1 - Page 293 



discourse context In assigning a plausibility factor to a 
particular Interpretation. By combining the plausibilities of 
the various parts of a sentence, we can derive an overall factor 
to help choose the best. 

There will always be cases where no set of heuristics will 
be enough. There will be multiple Interpretations whose 
plausibilities will be so close that It would be simply guessing 
to choose one. In our sample dialogue, there Is an example with 
the word "on". "The block Is on top of the pyramid." could mean 
either "directly on the surface" or "somewhere above". There Is 
no way for the hearer (or computer) to read minds. The obvious 
alternative Is to ask the speaker to explain more clearly what 
Is meant. As a final resort, the system can ask questions like 
"By the word "on" In the phrase "on top of green blocks" did you 
mean 'directly on the surface' or 'somewhere above 1 ?". The 
methods used for handling ambiguity are described In more detail 
In section b.Z.10 
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!UU5 p^coursc 

At the beginning of our discussion of semantics, we 

discussed why a semantic system should deal with the effect of 
"setting" on the meaning of a sentence. A semantic theory can 
account for three different types of context. 

First, there Is the 1oc*i discourse context, which covers 
the discourse Immediately preceding the sentence, and Is 
Important to semantic mechanisms like pronoun reference. If we 
ask the question "Did you put it on a green pjTfi?" or "to?" or 
"How many of ihem were there Hifin?", we assume that It will be 
possible to fill In the missing Information from the Immediate 
discourse. There are a number of special mechanisms for using 
this kind of Information, and they form part of a semantic 

theory. 

Second, there Is an overall dlscqurse. context. A hearer 
will Interpret the sentence "The group didn't have an Identity." 
differently depending on whether he Is discussing mathematics or 
sociology. There must be a systematic way to account for this 
effect of general subject matter on understanding. In addition 
to the effects of general subject on choosing between meanings 
of a word, there Is an effect of the context of particular 
things being discussed. If we are talking about Argentina, and 
say "The government Is corrupt.", then It Is clear that we mean 
"the government of Argentina". If we say "Pick up the 
pyramid.", and there are three ovramlds on the table, It will 
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not be clear which one Is meant. But If this Immediately 
follows the statement "There Is a block and a pyramid In the 
box/ 1 , then the reference Is to the pyramid In the box. This 
would have been clear even If there had been several sentences 
between these two. Therefore this Is a different problem than 
the local discourse of pronoun reference. A semantic theory 
must deal with all of these different forms of overall discourse 
context. 

Finally, there Is a context of knowledge about the world, 
and the way that knowledge effects our understanding of 
language. If we say "The city councilmen refused the 

demonstrators a perrrl t because they feared violence/ 1 , the 
pronoun "they 11 will have a different Interpretation than If we 
said "The city councilmen refused the demonstrators a permit 
because they advocated revolution/ 1 We understand this because 
of our sophisticated knowledge of councilmen, demonstrators, and 
politics -- no set of syntactic or semantic rules could 
interpret this pronoun reference without using knowledge of the 
world. Of course a semantic theory does not Include a theory 
of political power groups, but It must explain the ways In which 
this kind of knowledge can Interact with linguistic knowledge In 
Interpreting a sentence. 

Knowledge of the world may affect not only such things as 
the Interpretation of pronouns, but may alter the parsing of the 
syntactic structures as well. If we see the sentence "He hit 
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the cer with a rock." the structure will be parsed differently 
from "He hit the car with a dented fender.", since we know that 
cars have f enders, but not rocks. 

In our system, most of this discourse knowledge Is called 
on by the semantic specialists, and by particular words such as 
"one", "It", "then", "there", etc. We have concentrated 
particularly on local discourse context, and the ways In which 
English carries Information from one sentence to the next. A 
number of special pieces of Information are kept, such as the 
time, place, and objects mentioned In the previous sentence. 
This Information Is referenced by special structures and words 
like pronouns, "then", and "there". The meanlnc of the entire 
previous sentence can be referred to In order to answer a 
question like "Why did you do that ?" or just "Why?". 

There ^re two facilities for handling overall discourse 
context. The first Is a mechanism for assigning a 
"plausabl 1 1 ty factor" to an Interpretation of a word. For 
example, the definition of the word "bank" might Include the 
fact that If we ^re discussing money. It Is most likely to mean 
a financial Institution, while If we are discussing rivers. It 
probably means the edge of the land. Our system allows the 
definition of a word to Include a program to compute a 
"plausabl 1 1 ty factor" (an arbitrary additive constant) for each 
Interpretation, This computation might Involve looking at the 
rest of the sentence for key words, or might use some more 
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general Idea, like keeping track of the general area of 
discussion (perhaps In some sort of network or block structure) 
and letting the plausibility of a particular meaning depend on 
its "distance" from the current topic. This has not been 
Implemented since we have Included only a single topic of 
discourse In the vocabulary. It Is discussed further In section 
5.2. 

The second type of overall discourse context Involves the 
objects which have been previously mentioned. Whenever an 
object or one of Its properties Is mentioned, either by the 
human or the computer, a note Is made of the time. Later, If we 
use a phrase like "the pyramid", and the meaning Is not clear, 
the system can look for the one most recently mentioned. 

Finally* the knowledge of the world can enter Into the 
semantic Interpretation. We have mentioned that the grarmar can 
ask the semantic Interpreter "Does this NOUN GROUP make sense?" 
before continuing the parsing. The semantics programs can In 
turn call on PLANNER to make any deductions needed to decide on 
Its sensibility* Thus Information about the world can guide the 
parsing dl rectly. 
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fr-1-6 GqjIs of a Semantic Theory 

We have set ourselves very brood goals In our definition of 
semantics, asking for everything which needs to be done, rather 
than limiting ourselves to those aspects which can be explained 
and characterized In a neat formalism. How does this compare 

with the more limited goals of a semantic theory like that of 

Fodor and Katz <Fodor>, which looks only at those aspects of 

meaning which are Independent of the "setting 11 of a sentence? 

We have seen that their theory of "semantic markers" is In 

fact a part of the "filtering" needed for "exploiting semantic 

relations In the sentence to eliminate potential ambiguities" 

(<Fodor>p. t»b5)", and that the "semantic dlst Inguishers" are a 

rudimentary form of the logical descriptions which we build up 

to describe objects and events. They state that "the 

distinction between markers and dl st Inguishers Is meant to 

coincide with the distinction between that part of the meaning 

of a lexical Item which Is systematic for the language and that 

part of the meaning of the I tern which Is not." (<Fodor> p. li98). 

We believe that much more of meaning Is systematic, and that a 

semantic theory can be of a much wider scope. 

What about the more restricted goals a semantic theory 
might achieve such as "accounting for.., the number and content 
of the readings of a sentence, detecting semantic anomalies, and 
deciding upon paraphrase relations between sentences."? In a 
more complete semantic theory, these are not primary goals, but 
by-products of the analysis. A phrase Is a semantic anomaly If 
the system produces no possible Interpretations for It. Two 
sentences are paraphrases If they produce the same 
representation In the Internal forrr-allsm for meaning, and the 
"number and content 11 of the readings of a sentence are the 
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Immediate result of Its semantic analysis. Which of these will 
happen depends on the entire range of ways In which language 
communicates meaning, not on a restricted subset such as the 
logical relations of markers. Once we have a conceptual 
representation for meaning, problems such as these are secondary 
byproducts of the basic analysis which relates a sentence to the 
representation of Its meaning. 

In addition, we can talk about sentences being anomalies or 

paraphrases "In context", as well as "without regard to 

context", since we want the theory to Include a systematic 

analysis of those features of context which are relevant to 

understanding. 
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b.2 Semantic Structures 
The previous section outlined the structure of a semantic 
interpreter, and described the use of semantic "specialists 11 In 
analyzing different aspects of linguistic structure* Each 
specialist Is In the form of a procedure which describes the 
particular aspect of semantics which with It Is concerned. We 
can look at Its function as creating a part of a complete 
description of the meaning of the sentence by building complex 
list structures which we will call "semantic structures" to 
describe objects and relationships. Events are a type of 
relationship (Involving time), and the class of "object" 
Includes anything which could be treated as an object In English 
grammar, even If It Is as abstract as "truth 11 . There are two 
bas lc types of structures used — one to describe objects, (an 
Object Semantic Structure, or OSS) and the other to describe 
relationships (an RSS). In general, noun groups are Interpreted 
to form object structures, while the other groups and clauses 
are Interpreted to form relationship structures. Words al ready 
have a semantic structure of their own (their definition) and 
are used In building up the structures for the larger units 
which contain them. 
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JLOU OfrJCCt Semantic Structures 

Let us first look at the semantic structures used to 

describe objects. First, we need the actual PLANNER statements 

which will be used In deducing things about the objects. An NG 

like "a red cube" can be described using the formalism of 
Chapter J: 



(THPROG (XI) 

(THGOALCIS S?X1 'BLOCK)) 
UEQDIM J?X1) 
<THGOAL(«C0LOR $?X1 #RED))) 

Figure 57 — Simple PLANNER Description 



The variable "XI" represents the object, and this 
description says chat It should be a block. It should have equa 
dimensions, and It should be red. (See section 3.U for the 
details of representation). A phrase such as "a red cube which 
supports three pyramids but Is not contained In a box" has a 
more complex description. This would be built up from the 
descriptions for the various objects, and would end up: 



(THPROG(Xl) 

(THGOAICMS t«U 'BLOCK)) 

CfEQDIM $?X1) 

(THGOALCCOLOR $?X1 #RED)) 

(THFIND 3 *?X2 <X2) (THG0AL(#IS $?X2 #PYRAMID>) 

(THGOALCSUPPORT S?X1 S?X2))) 
(THN0T<THPR0G(X3) 

(THGOALCIS JTX3 #B0X>) 

(THGOAL(#C0NTAIN $?X3 $?X1)>))> 

Figure 58 — PLANNER Description 



We can learn how the semantic specialists work by watching 
them build the pieces of this structure. First take the simpler 
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NG, M a red cube 11 . The first NG specialist doesn't start work 
until after the noun has been parsed. The PLANNER description 
Is then built backwards, starting with the noun, and continuing 
in r I gh t- to-1 p* 1 i order through the classifiers and adjectives. 
The beginning of the NG, with the determiner, number, and 
ordinal Is handled by a part of the NG specialist described 
later. The first NG specialist Is named SMHG1 -- all of the 
names begin with SM (for "semantic"), followed by the name of 
the unit they work with, followed by a number Indicating the 
order In which they are called. 3MNG1 sets up an environment 
(we will describe various parts of It as we go), then calls the 
definition of the noun. (Remember that definitions are In the 
form of programs). For simple nouns there Is a standard 
function to define then easily, What should the definition 
Include? First, a way to Indicate the PLANNER statements which 
are the heart of Its meaning. The symbol "***" Is used to 
represent the object, so our definition of "cube 11 contains the 
expression: 

((#IS *** #BL0CK)(*EQDIM ***>) 
The syntax of PLANNER functions such as THPROG and THGOAL will 
be added by the specialists, since we want to keep the 
definition as simple as possible. 

There Is one other part of the definition for a noun « the 
semantic markers, used to filter out meaningless 
Interpretations of a phrase. The definition needs to attach 
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these semantic markers to each OSS. The BLOCKS world uses the 
tree of semantic markers In Figure 59. 

This Is the same type of diagram used for grammars. In 
which vertical bars represent choices of mutually exclusive 
markers, while horizontal lines represent logical dependency. 
The symbol "fPHYSOB" means "physical object", and "tMANIP" means 
"manlpulable object". The word "cube" refers to an object with 
the markers (#THIM "PHYSOB #MANIP #BL0CK). We shouldn't need 
to mention all of these In the definition, since the presence of 
•BLOCK Implies the others through the logical structure of the 
marker trre. 

The definition of the noun "cube" Is then: 
(NMEANS((#BLOCK)((#IS "* ("BLOCK) ( #EQD IM ••*)))> 

NMEAKS Is the name of the function for dealing with nouns, 

and It accepts a list of different meanings for a word. In this 

case, there Is only one meaning. The first part of the 

definition Is the marker list, followed by the reduced PLANNER 

definition. Uhen NHEANS Is executed. It puts this Information 

onto the semantic structure which Is being built for the object. 

It takes care of finding out what markers are Implied by the 

tree, and deciding which predicates need to be In a THGOAL 

statement (like MS), and which are LISP predicates (like 

'EQDIMJ. We will see later how It also can decide what 

recommendation lists to put onto the PLANNER goals, to guide the 
deduction. 
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THING*- 



I NAME 

(PLACE 

• PROPERTY— i 



(SHAPE 

(SIZE 
(LOCATION 

(COLOR 



IAN I MATE- 



I (ROBOT 
•HUMAN 



^ SPECTRUM 



(PHVSOB- 



c 



(BLUE 

'RED 

(BLACK 

(WHITE 

(GREEN 

(CONSTRUCT 
(HAND 




*TABLE l'PYRAl 
(MAN IP— -I'BLOC 
(BOX MEALL 



(PYRAMID 
OCK 



(RELATION 



H (EVENT 
(T1MEL 



IE LESS 



Figure 59 -- Semantic Markers for the BLOCKS Vocabulary 
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SMHGl then calls the definition for the adjective "red". 

we would like this definition to Include the PLANNER assertion 

('COLOR **« #RED), and Indicate that It applies only to physical 

objects. We can use the same format used for nouns, defining 
"red" as: 

(NMEAHS((#PHYSOB)((#COLOR **• *REO)))) 

Notice that there Is no distinction made between the use of 
'PHYSOB here to Imply "applies only to physical objects" and the 
use of >BLOCK In the definition of "cube" to say "this Is a 
block". This Is because of the way the markers are 
Implemented. The marker list In a definition Is Interpreted to 
mean "this definition applies only If none of the markers here 
are In conflict with any of the markers already established for 
the object". Since the noun Is the first thing Interpreted, Its 
markers cannot possibly conflict., and are simply entered as the 
Initial marker list for the object. The marker programs are 
designed so that we do not need to limit ourselves to a single 
tree - we could classify objects along several dimensions, and 
set up separate marker trees for each. Pnr example, we might 

classify objects both by their physical properties and by their 
use. 

The order of analysis of modifiers Is aulte natural to the 
use of "relative" modifiers. It Is Impossible to give an 
absolute definition for "big" or "little", since a "big flea" Is 
still not much competition for a "little elephant". The meaning 
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of the adjective Is relative to the noun It modifies. In fact. 
It may also be relative to the adjectives following It as well. 
A "big toy elephant" Is on a scale of Its own. Since our system 
analyzes the HG from right to left, the meaning of each 
adjective Is added to the description already built up for the 
head and modifiers to the right. Since each definition Is a 
program. It can just as well be a program which examines the 
description (both the semantic markers and the PLANNER 
description), and produces an appropriate meaning relative to 
the object being described. This might be In the form of an 
absolute measurement (e.g. a "big elephant" Is more than 17. feet 
tall) or can remain In a relative form by producing a PLANNER 
expression of the form "the number of objects fitting the 
description and smaller than the one being described Is more 
than the number of suitable objects hlgger than It Is". 

In adding the meaning of "red" to the semantic structure, 
the specialist must make a choice In ordering the PLANNER 
expressions. We remember from section 2.1 that the order of 
expressions can be Important, since variable assignments are 
done in the order encountered. If we have the first sequence 
shown In Figure 60, PLANNER will look through all of the blocks, 
checking until It finds one which is red. However If we have 
the second. It will look through all of the red objects until It 
finds one which is a block. In the robot's tiny world, this 
Isn't of much Importance, but If we had a data base which could 
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take phrases like "a nan In this room", we would certainly be 
better off looking around the room first to see what was a man, 
than looking through all the men In the world to see If one was 

In the room. 

) 

(THPROG(X) 

(THGOALClS $?X #BL0CK)) 
(THGOALUCOLOR $?X #HED>)) 

(THPROG(X) 

(THGOALC#COLOR S?X *RED)> 
(THGOALCIS S?X #BL0CK))) 

Figure 60 -- Ordering Goals 



To make this choice we allow each predicate (like #IS or 
•COLOR) to have associated with It a program which knows how to 
evaluate Its "priority" In any given environment. The program 
might be as simple as a single number, which would mean "this 
relation always has this priority". It might on the other hand 
be a complex heuristic program which takes Into account the 
current state of the world and the discussion. In our 
definitions, we have adopted the simpler alternative, assigning 
fixed priorities In the range to 1000 arbitrarily. By keeping 
track of the priority of the expression currently at the top of 
the PLANNER description, the function NMEANS can decide whether 
to add a new expression above or below It. 

Let us now look at the actual structure which would be 
built up by the program: 
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( ( ((XI) 200 (THGOAUfIS $?X1 'BLOCK)) 

(THG0AL( 'COLOR $?X1 *RED)) PLANNER 

UEQDIM $?X1)> description 

(0 #BLOCK WAN IP #PHYSOB #THING) markers 

(WANIP fPHYSOB *THING) systems 

xl variable 

(NS INDEF NIL) determiner 

NIL) 



ordinal 



Figure 61 -- OSS for "a red cube" 



Most of the oarts of this structure (called an flbiect; 
Semantic Structure or OSS) have already been explained. The 
PLANNER description Includes a variable list (we will see Its 
use later), the priority of the first expression, and a list of 
PLANNER expressions describing the object. The "markers" 
position lists all of the semantic markers applicable to the 
object. The at the beginning of the list is the 
"plausablllty" of this Interpretation. This factor was 
discussed In section h.l.it, and Is set when we are faced with 
more than one possible Interpretation of a word. Each semantic 
structure carries along with it an accumulated plausablllty 
rating. This will remain unless It Is set specifically by an 

amblgul ty. 

The "systems" position Is a list of all of the nodes In 
the set of marker trees (remember that there can be more than 
one) which have already had a branch selected. it Is used in 
looking for marker conflicts. The "variable" Is the variable 
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name chosen to represent this object. The system generates It 
from the set XI, X2, X3..., providing a new one for each new 
structure. The only two positions left are the determiner and 
the ordinal. These are explained In section d.2,ii 
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0*1 Be'atlve Clause* 

Let us now take a slightly more complicated NG, "a red cube 
which supports a pyramid," and follow the parsing and semantic 
analysis. First, the HG parsing program finds the determiner 
("a"), adjective ("red"), and noun ("cube"). At this point 
SMNGl Is called and creates the structure described In the 
previous section. Notice that the NG Is not finished when SMNGl 
is called — It has only reached a point where we can do a first 
analysis. At this point, the KG might be rejected without 
further parsing If the combination of noun, classifiers, and 
adjectives Is contradictory to the system of semantic markers. 
Next the NG program looks for a qualifier, and calls the 
CLAUSE part of the grammar by (PARSE CLAUSE RSQ) . The feature 
RSO (rank shifted qualifier) Informs the CLAUSE program that It 
should look for a RELWD like "which". It does, and then looks 
for a VG, succeeding with "supports". The VG program calls Us 
own semantic specialist to analyze the time reference of the 
clause, but we will Ignore this for now. Next, since "support" 
is transitive, the CLAUSE looks for an object, and calls the NG 
program. This operates In the same way as before, producing a 
semantic structure to describe "a pyramid". The definition of 

"pyramid" is: 

(NMEANS((#PYRAMID)(('IS ••* 'PYRAMID)))) 

so the resulting structure is: 
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( (<X2> 200 <THG0AL(#IS s?X2 'PYRAMID))) 

(0 'PYRAMID 'MANIP 'PHYSOB THING) 
<*>MANlP VPHYSOR (THING)) 

X2 

(MS INDEF NIL) 

NIL) 

Figure 62 -- OSS for "a pyramid" 



At this point the first CLAUSE specialist Is called to 
analyze the clause "which supports a pyramid". We want to 
define verbs In a simple way, as we do nouns and adjectives, 
saying something like "If the subject and object are both 
physical objects, then "support" means the relation 'SUPPORT 
between them In that order". This Is written formally using the 
function CMEANS, as: 

(CMEANS(((<*PHYS0B))<('PHYS0B)))(*SUPP0RT #1 #2)NIL>) 

All of the extra parentheses are there tn leave room for 
fancier options which will be described later. The Important 
parts are the semantic marker lists for the objects 
participating In the relationship, and the actual PLANNER 
expression naming It. The symbols "#1" and "#2" (and "*3" If 
necessary) are used to Indicate the objects, and the normal 
order Is 1. semantic subject (SMSUB) 2. semantic first object 
(SM0B2) 3. semantic second object (SM0B2). Notice that we 
have prefixed the word "semantic" to each of these. In fact, 
they may very well not be the actual syntactic subject and 
objects of the clause. In this example, the SMSUB Is the NG "a 
red cube" to which the clause Is being related. SMCL1 knows 
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this since the parser has noted the feature SUBJREL. Before 

calling the definition of the verb, SMCL1 has found the OSS 

describing "a red cube" and set It as the value of the variable 

SMSUB. Similarly It has taken the OSS for "a pyramid" and put 

It In SM0B1, since It is the object of the clause. The 

definition of the verb "support" Is now called, and CMEANS uses 

the Information In the definition to build up a Relation 

Semantic Structure (RSS). First It checks to make sure that 

both objects are compatible with their respective marker lists. 

The marker lists are In the same order as the symbols #1, *2, 

and #3. In this case, both the subject and object must be 

physical objects. 

Next SMCL1 substitutes the objects Into the relation. If 

It Inserted the actual semantic structures, the result would be 
hard to read and time-consuming to print. Instead, the NG 
specialists assign a name to each OSS, from the set HQ1, NG2, 
NG3,... We therefore net (#SUPPORT NG1 NG2) as the description 
of the relationship. The final semantic structure for the 
clause (after a second specialist, SMCL2 has had a chance to 
look for modifiers and rearrange the structure Into a convenient 
form) Is: 
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(NG1 USUPPORT NG1 NG2) 

•*e' relation 



NIL) (0}) 
neg markers 



Figure 63 -- Relation Semantic Structure 1 



The position marked "rel" holds the name of the NG 
description to which this clause serves as a modifier. We will 
see later that it can be used In a more general way as well. 
The "relation" Is the material for PLANNER to use, and "neg" 
marks whether the clause Is negative or not. 

The last element Is a set of semantic markers and a 
priority, just as we had with object descriptions. 
Relationships have the full capability to use semantic markers 
just as objects do, and at an early stage of building a relation 
structure. It contains a PLANNER description, markers, and 
systems In the identical form to those for object structures 
(this Is to share some of the proerams, such as those which 
check for conflicts between markers). We can classify different 
types of events and relationships (for example those which are 
changeable, those which Involve physical motion, etc.) and use 
the markers to help filter out interpretations of clause 
modifiers. For example, the modifying PREPG "without the 
shopping list" In "He left the house without the shopping list" 
has a different Interpretation from "without a hammer" In "He 
built the house without a hammer." If we had a classification of 
activities which Included those Involving motion and those usine 
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tools, we could choose the correct Interpretation. A system can 
be constructed which operates much like Fillmore's case system, 
assigning classes of verbs according to the type of modification 
they take, and using this to find the correct relation between a 
verb and Its modifying phrase. This will be discussed more In 
the section on types of PREPG. 

In our limited world, we have not set up a marker tree for 
relationships and events, so we have not Included any markers In 
the definition of "support". The marker list In the RSS 
therefore contains only the plausibility, 0. The "MIL" In the 
definition Indicates that there are no markers, and would be 
replaced by a list of markers If they were used. 

The clause Is now finished, and the specialist on relative 
clauses (SKRSO.) Is called. Its task Is to take the Information 
contained In the PLANNER descriptions of the objects Involved In 
the relation, along with the relation Itself, and to put It all 
onto the PLANNER description of the object to which the clause 
Is belna related. The way In which this Is done depends on the 
exact form of the different objects (particularly on their 
determiners). In this case. It Is relatively easy, and the 
description of "a red cube which supports a pyramid" becomes: 
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( ( ({XI X2) 200 (THGOALUIS mi #RlOCK)) 

(THGOAL(#COLOR S?X1 *RED>) 

C'EQDIM S?X1) 

(THG0AL(#IS $?X2 fPYRAMID}) 
(THGOALUSUPPORT $?X1 l?X2>)) 

(0 #BL0CK f MAN IP ♦PHYSOB 'THING) 

WMANIP #PHYS0ft 'THING)) 
XI 

(NS INDEF NIL) 
NIL) 

Figure 6*4 -- OSS for "a block which supports a pyramid 



_ 



The only thine which has changed Is the PLANNER 
description, which now holds all of the necessary information. 
Its variable list contains both XI and X2, and these variable 
names have been substituted for the symbols NG1 and NG2 In the 
relation, which has been combined with the separate PLANNER 
descriptions for the objects. Section k,2.b describes how a 
relative clause works with other types of NG descriptions. 



- ' 

i: ..a 
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U.2.5 Preposition Groups 

Comparing the phrase "a red cube which supports a pyramid" 
with the phrase "a red cube under a pyramid, we see that 
relative clauses and Qualifying prepositional phrases are very 
similar In structure and meaning. In fact, their semantic 
analysis Is almost Identical. The definition of a preposition 
like "under" uses the same function as the definition of a verb 
like "support", saying "If the semantic subject and object are 
both physical objects, then the object Is 'ABOVE the subject" 
(Remember that In our BLOCKS world we chose to represent all 
vertical space relations using the concept 'ABOVE). This can be 

formal Ized as: 

(CMEANS(<<(#PHYSOB)>U*PHYSDB>))(#ABOVE #2 *1)NIL) 

Again, the symbols #1 and *2 refer to the semantic subject 

and semantic first object, but In the case of a preposition 

group used as a qualifier, the SMSUB Is the NO of which the 

PREPG Is a part, while the SM0B1 Is the object of the PREPG (the 

PREPOBJ). As with clauses, the situation may be more complex. 

For example. In a sentence like "Who was the antelope I saw you 

with last night?", the SMOBJ of the PREP "with" Is the question 

element "who" In the MAJOR CLAUSE. However, the PREPG 

specialist (SMPREP) takes care of all this, and In defining a 

preposition, we can deal directly with the SHSUB and the SM0B1. 

Notice that If we had been defining "above" Instead of "under", 

everything would have been the same except that the relation 
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would have been <#ABOVE #1 #2) Instead of (#ABOVE tt 01). If 
the PHEPG Is an adjunct to a CLAUSE, the SMSUBJ Is the RSS 
deflnlns the CLAUSE. The definition of a preposition can then 
use the semantic markers which are Included In an RSS, 
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h.2.ti Ivpes a£ Object pp sgr I pi; Ions 

In the examples so far, all of the objects described have 
been singular and INDEFInlte, like "a. red cube", and the 
semantic system has been able to assign them a PLANNER variable 
and use It in building their properties Into the description. 
Let us consider another simple case, B DEFinite object, as In "a 
red cube which supports the mfJflUil « 

The analysis begins exactly as it did for the earlier case, 
building a description of "red cube", then one of "pyramid." 
The "pyramid" description differs from nSS 2 In having. DEF In 
place of INDEF In Its determiner. This Is noted at the very 
beginning of the analysis, but has no effect until the entire NG 
(Including any qualifiers) has been parsed. At that time, the 
second NG specialist SMNG2 checks for a definite NG and tries to 
determine what It refers to before going on (we have pointed out 
In various places how this Is used to guide the parsing). It 
takes the PLANNER description which has been built up, and hands 
It to PLANNER In a THFIND ALL expression. The result Is a list 
of all objects fitting the description. Presumably if the 
speaker used "the", he must be referring to a particular object 
he expects the listener to be aware of. If more than one object 
fits the description, there are various discourse heuristics 
used to find the reference, (see Section 1..3.3) and if nothing 
succeeds, a failure message Is produced and the parser has to 
back up and try something else to parse the NG. 
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If SMNG2 Is able to find the object being referred to. It 
puts It Into the description (on the property list). When SMRSQ 
relates the descriptions to build the meaning of "a red cube 
which supports the pyramid" It takes advantage of this. The 
object found will have a proper name like :B5. Instead of 
building the PLANNER description of OSS 3, It builds: 



((XI) 200 (THGOAKMS $?X1 *BL0CK>) 
(*EQDIM S?X1) 
(THGOALUSUPPORT S?X2 :B5))) 

Figure 65 -- PLANNER Description 1 
a red cube which supports the pyramid" 



The object Itself Is used In the relation rather than dealing 
with Its description. 

What If we had asked about "a red cube which supports three 
pyramids"? In that case the PLANNER description would Include 
an expression using the function THFIND with a numerical 
parameter, as shown in Figure 66. If we had said "a red cube 
which supports at most two pyramids", a fancier THFIND parameter 
would have been used, as shown. Here, the parameter means "be 
satisfied If you don't find any, but If you find 3, Immediately 
cause a failure." In addition to numbers, the SMNG1 and RSQ 
programs can work together to relate descriptions of quantified 
objects. "A red cube which supports iflmj. pyramid" Is handled 
just like the original Indefinite case. "A red cube which 
supports no. pyramid" and "a red cube which supports every 
pyramid" are handled using the other PLANNER primitives. A 
universal quantifier is translated as "there Is no pyramid which 
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(THGOALCIS *?X2 'PYRAMID)) 
(THGOAK 'SUPPORT t?Xl J?X2>> 

"which supports a pyramid" 

(THGOAU'SUPPORT $?X1 :B3)) 

"which supports the pyrairld" 

{THFIND 3 $?X2 (X2) CTHSOAUffU [JJM JJJWSlSUn 

(THGOAIUSUPPORT $?X1 $?X2))) 

"which supports three pyramids" 
(THF.HD <0 5 .IU |t» <**> <^0AUJ,S p m2 mi«ig ) 
"which supports at most two pyramids" 



<TH (THPR0G (X2> (THGOALCIS $?X2 WMHID)) 

(THGOAIUSUPPORT $?Xl $?X2))))) 

"which supports no pyramids" 



(TH (TMPR0G (X2) (THGOALUIS S?X2 'PYRAMID)) 

<THG0AL( 'SUPPORT $?X1 $?X2>)))) 
"which supports every pyramid" 

Figure 66 -- Quantifiers 
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the red cube does not support". For the robot, "every" means 
"every one I know about". This Is not a requirement of PLANNER, 
or even of the way we have set up our semantic programs. It was 
done as a convenience, and will be changed when the system Is 
expanded to discuss universal statements as well as the specific 
commands and questions It now handles. 

We similarly handle the whole range of quantifiers and 
types of numbers, using the logical primitives and THFIHD 
parameters of PLANNER. The work is actually done In two places. 
SHNG1 takes the words and syntactic features, and generates the 
"determiner" which was one of the Ingredients of our semantic 
structure for objects. The determiner contains three parts. 
First, the number Is either NS (singular, but not with the 
specific number "one"), NPL (plural with no specific number), 
NS-PL (ambiguous between the two, as In "the fish"), or a 
construction containing an actual arithmetic number. This can 
either be the number alone, or a combination with ">", "<■', or 
"exactly". Thus the two NGs "at most two days" and "fewer than 
three days" produce the identical determiner, containing "(< 
3)". The second element of the determiner Is either OEF, 
INDEF, ALL, NO, or NDET (no determiner at all -- as In "We like 
illfififi.") The third Is saved for the question types HOWMANY and 
WHICH, so It Is NIL In a NG which Is not a QUEST or RFL. 
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Number 

NS 

NPL 

7 

{> 2) 

(< 5) 

(EXACTLY 2) 

Determiner 

DEF 

ItiDEF 

ALL 

NO 

NDET 



Question Marker 

HOWMANY 
WH I CH 



an apple 

some thoughts 

seven sisters 

at least three ways 

fewer than five people 

exactly two minutes 



the law 

a riot 

every child 

nothing 

good Intentions 



how many years 
which road 



Figure 67 -- Examples of Determiner F.lements 



Other specialists such as SMRSQ and the answering routines 
use this Information to produce PLANNER expressions like the 
ones described above. In addition, there are special programs 
for cases like the OF NG, as In "all fi£ your dreams". in this 
case, the PREPOBJ following "of" Is evaluated as a NG first. 
Therefore In "three of the blocks", we analyze "the blocks" 
first, and since It Is definite, PLANNER Is called to find out 
what It refers to. It returns a list of "the blocks", (e.g. 
(:B1 :B4 :B6 :B7>). The OF specialist uses the PLANNER function 
THAMONG (which chooses Its variable bindings from "among" a 
given list) to produce an expression like: 
(THFIND 3 J?X1 (XI) (THAMONG XI (QU0TE(:B1 : B4 :B6 :B7))>) 
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Ordinals are treated specially, along with superlative 
ADJectlves. If we have a NG like "the biggest block which 
supports a pyramid". It Is Impossible for SMNG1 to add the 
meaning of "biggest" to the description In the same way as It 
would add an expression for "bis". The block Is "bluest" with 
respect to a group of objects, and that group Is not fully 
defined until the entire NQ has been parsed. Including thp 
qualifiers. SMHG1 therefore does a partial analysis of the 
meaning, looking up the name of the measure that particular 
adjective refers to, then hangs the result In the last niche of 
the OSS described In section 1..2.1 After all has been parsed, 
SMNG2 finds It there and creates a full logical description. In 
the case of "the biggest block which supports a pyramid", we 
would get the PLANNER description: 



((XI X2 X3 XU ) 2(10 

(THGOALCMS $?X1 #8lOCK)) 

(THGOAL(#IS S?X2 #PYRAMIO) ) 
»THGOAL(#SUPP0RT $?X1 S?X?)) 
(THNOT 

(THAND<THGriALUlS $?X3 #BL0CK>) 
(THGOALUIS S?X1| *PYRAf!lD>> 
(THGOALUSUPPORT $?X3 $?X*)> 
CTHGOAUIMORE *SIZE $?X3 !t?Xl))>)) 

Figure 68 — PLANNER Description 2 
Enc biggest block which supports a pyramid" 



A similar type of description Is generated for other 
superlatives and ordinals. 
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ii.2.5 The Meaning Ql ClUCSt'oPS 

So far, we have discussed the semantics of objects and the 
relationships which are used to describe them In preposition 
groups and relative clauses. Now we will deal with the overall 
meaning of a sentence as an utterance -- as a statement, a 
question, or a command. The sentence Is analyzed Into a 
relationship semantic structure, and the system must act on It 
by responding, taking an action, or storing some knowledge. 

First let us look at questions. In describing the grammar 
of clauses (see section 2.3.3) we pointed out the similarities 
between questions and relative clauses, which share a large part 
of the system network and the parsing program. They also have 
much In common on a semantic level. We can look at most 
questions as being a relative clause to some focus element In 

the sentence. 

In the class of WH questions, this resemblance Is easy to 
see. First we can take a NGQ question, whose question element 
Is a NG. The question "Which red cube supports a pyramid?" Is 
very closely related to the NG "a red cube which supports a 
pyramid. The system can answer such a question by relating the 
clause to the object, and building a description of "a red cube 
which supports a pyramid." It then takes this entire PLANNER 
description and puts It Into a THFIND ALL statement, which Is 
evaluated In PLANNER. The result Is a list of objects fitting 
the description, and Is In fact the answer to our question. Of 
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course PLANNER might find several objects or no objects meeting 
the description. In this case we need answers like "none of 
them" or "two of them". Section b.U describes how responses to 
questions such as these are Generated, depending on the relation 
between the specific question and the data found. If the 
question Is "how many" Instead of "which", the system goes 
through the Identical process, but answers by counting rather 
than naming the objects found. 

No matter what type of NGQ we have (there Is a tremendous 
variety -- see section 2.3.3) the same method works. We treat 
the MAJOR clause as a relative clause to the NG which Is the 
question element, and which we call the focus. This Integrates 
the relationship Intended by the clause Into the description of 
that object. PLANNER then finds all objects satisfying the 
expanded description, and the results are used to generate an 
answer. 

Next, we have the QADJ questions, like "when", "why", and 
"how". In these cases the focus Is on an event rather than on 
one element of the relation. If we ask "Why did you pick up a 
block?", we are referring to an event which was stored In the 
system's memory as ('PICKUP E23 :B5) where iBS Is the name of 
the object picked up, and E23 Is the arbitrary name which was 
assigned to the event (see Section 3.b for a description of the 
way such Information Is stored.) We can ask In PLANNER; 
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(THF1ND ALL S7EVENT (S7EVENT S?X) 

(THGOALHPICKUP S7EVENT $?X>> 
(THGOALUIS S?X *BLOrK)>) 

In other words, "Find all of the events In which you picked up a 
block." This Is clearly the first thing which must be done 
before we can answer "why". Once It has been done, answering Is 
easy, since PLANNER will return as the value of THFIHO a list of 
names of such events. On the property list of an event we find 
the name of the event for which It was called as a subgoal (the 
"reason"). We need only to describe this In English. Similarly 
If the question Is "when", the property list of the event gives 
Its starting and ending tines. If the question Is "how" It 
takes a little more work, since the subgoal tree Is stored with 
only upward links. But by looking on the EVENTLIST, the system 
can generate a list of all those goals which had as their reason 
the one mentioned In the sentence. 

This concept of a relation as a sort of object called an 
"event" Is useful In other parts of the semantics as well -- for 
Instance In dealing with embedded clauses as In "the block which 
I told you to pick up". This Is described In section 4.2.12. 
"Where" is sometimes handled differently, as It may be 
either a constituent of a clause such as a location object 
(LOBJ) (In "Where did you put lt7") or an ADJUNCT (as In "Where 
did you meet him?"). The first case Is handled just like the NG 
case, making the clause a relative, as If It were "the place 
where you put It", then asking In PLANNER: 
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(THFlND ALL S7PLACE (PLACE EVENT) 

(THGOAL ('PUT S7EVENT lOBJ S7PIACE))) 

The ADJUNCT case Involves thinking about a special -LOCATION 
assertion, as In: 

CTHFIND ALL J7PLACE {PLACE EVENT) 

(THGOALE'MEET $?EVENT :Y0U :HIM)) 
(THGOAL ('LOCATION J7EVENT S7PLACE))) 

In this example, we have moved away from the BLOCKS world since 
It does not yet contain any actions In Its vocabulary which 
occur at a specific place without that place being mentioned In 
the event, such as *PUT. However the semantic system Is 
perfectly capable of handlfng such cases. 

So far, we have seen that we can answer WH- questions by 
pretending they are a relaclve to some object, event, or place, 
and by adding the relationship to the description of this focus. 
It Is an Interesting fact about English that even In a YES-NO 
question, where there Is no question element there Is usually a 
focus. Consider a simple question like "Does the box contain a 
block?" Someone might answer "Yes, a red one.", as If the 
question had been "Which block does the box contain?" Notice 
that "Yes, the box." would not have been an appropriate answer. 
Something about "the box" makes It obvious that It Is not the 
focus. It Is not Its place as subject or object, since "Is a 
block In the box?" reverses these roles, but demands the same 



answer. Clearly It Is the fact that "a block" Is an INDEFlnlte 
NG. 

The fact that a speaker says "a block" Instead of "the 
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block" Indicates that he Is not sure of a specific object 
referred to by the description. Even If he does not Inquire 
about It specifically, the listener knows that the Information 
will be new, and possibly of Interest since he mentioned the 
object. In answering "Does the box contain a block?", our 
system does the same thing It would do with "How many blocks 
does the box contain?". It adds the rplatlon "contained by the 
box" to the description of "a block", and finds all of the 
objects meeting this description. Of course the verbal answer 
Is different for the two types of question. In one case, "Yes" 
Is sufficient, while In the other "one" Is. But the logical 
deduction needed to derive It Is Identical. In fact, our system 
uses this extra Information by replying, "Yes, two of them: a 
red one and a green one." This may sometimes be verbose, but In 
fact gives a natural sound to the question-answering. It takes 
on the "Intelligent" character of telling the questioner 
Information he would be Interested In knowing, even when he 
doesn't ask for It explicitly. 

In YES-NO questions. It Is not always easy to determine the 
focus. Only an INDEF NG which Is not embedded In another NG can 
be the focus, but there may be several of them In a sentence. 
Sometimes there Is no way to choose, but that Is rare. In 
asking a question, people are usually focusing their attention 
on a particular object or event. There are a number of devices 
for Indicating the focus. For example a quantifier, like "any 
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ase 



or a TPRON like "something" emphasizes the NG more than a slmpl 
determiner like "a". In both "Does anything green support a 
block?", and "Does a block support anything green?", the phr 
"anything green" Is the focus. When none of these cues ar 
present, the syntactic function of the NG makes a difference. 
If we ask "Is there a block on a table", then "block" Is the 
focus, since It Is the subject while "table" Is Inside a PREPG. 
Our system contains a heuristic program which takes Into account 
the kind of determiners, number features (singular Is more 
likely than plural), syntactic position, and other such factors 
In choosing a focus. If It Is In fact very difficult to choose 
In a given case. It Is likely that the speaker will be satisfied 
wl th any choice. 

For sentences In the past tense, which contain no focus NG, 
we can again have an event as a focus. If we ask, "Did Jesse 
James rob the stagecoach?", a possible answer. Interpreting the 
event as the focus. Is "Yes, three times: yesterday, last week, 
and a year ago." This is closely parallel to answering questions 
In which the focus is an object. 

There are some questions which have no focus, such as 
present-tense clauses with only definite noun groups. These, 
however, are even easier to answer, since they can be expresssed 
In the form of a simple set of assertions with no variables. 
The NG analysis finds the actual objects referred to by a 
definite NG, and these are used In place of the variable In 
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relationships. We can therefore answer "yes" or "no" by making 
a goal of the relationship and letting PLANNER evaluate It. The 
question "Does the red cube support the box?" would generate the 
simple PLANNER expression 

(THGOAL {'SUPPORT : B3 :B0X)) 
If :B3 Is the Internal name for the red cube. PLANNER would 
return a non-NlL value only If the answer were "yes". 



Section U.2.6 - Pace 331 



^ ■ Z . 6 Interpreting Imoerat li/Pft 

The system can accept commands In the form of IMPERATIVE 
sentences. These are handled somewhat differently from 
questions. If they contain only definite objects, they can be 
treated In the way mentioned above for questions with no focus. 
The command "Pick up the red ball.". Is translated Into the 
relationship (#PICKUP :B7) which can be evaluated directly by 
putting It In a THGOAL statement which will carry out the 
action: 

(THGOAL UPICKUP :B7)(THUSE TC-PICKUP)) 
However, If wc say "Pick up & red ball.", the situation Is 
different. We could first use THFINO to find a red ball, then 
put this object In a simple goal statment as we did with "thm 
red ball". This, however, nl*ht be a bad Idea. In choosing a 
red ball arbitrarily, we may choose one which Is out of reach or 
which Is supporting a tower. The robot might fall or be forced 
to do a lot of work which It could have avoided with a little 
thought. 

We want to send the theorem which works on the goal a 
description rather than an object name, and let the theorem 
choose the specific object to be used, according to the criteria 
which best suit It. This Is the method w e have adopted. 
Remember that each OSS has a name like "NGU5". Before a clause 
Is related to Its objects, these are the symbols used In the 
relatlonshi p. 
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When we analyze "Pick up a red ball". It will actually 
produce <#PICKUP NGH5), where MRUS names an OSS describing "a 
red ball." We use this directly as a goal statement, calling a 
special theorem which knows how to use these descriptions. The 
theorem calls a theorem named TC-FINDCHOOSE, which uses the 
description of the object, along with a set of "desirable 
properties" associated with objects used for trying to achieve 
the goal. IPICKUP may specify that It would prefer picking up 
something which doesn't support anything, or which Is near the 
hand's current location. Each theorem can ask for whatever It 
wants. Of course. It may be Impossible to find an object which 
fits all of the requirements, and the theorem has to be 
satisfied with what It can get. TC-FINDCHOOSE tries to meet the 
full specifications first, but If It can't find an object (or 
enough objects In the case of plural). It gradually removes the 
restrictions. It must always keep the full requirements of the 
description Input In English In order to carry out the specified 
command. The robot simply tries to be clever about choosing 
those objects which fit the command but are also the easiest for 
I t to use. 
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^ 2 -7 Accepting Declarator Informatlnn 

In addition to questions and commands, the system can 
accept declarative sentences. We have Intentionally not 
emphasized them, as there are theoretical problems and dangers 
In designing a program to accept Information In this way. In 
Chapter 3, we discussed the complex world-model a person has and 
explained why we felt that Intelligence needed a highly 
structured and coordinated body of knowledge rather than a set 
of separate uniform facts or axioms. It is comparatively easy 
to get a program to add new Information of the second type, but 
very difficult to get It to add the first, since this Involves 
understanding the relationship between the new Information and 
whatever Is already there. 

Therefore, although we have Included declarative sentences 

iSi?^ d J; 1 ;\ < ; nd the v re fu ' ,v hand,ed ,n the eraiUi", cr 

believe that before trying to "tell" many things to a program 
we need to have a better Idea of how knowledge should be 
nS r !^? the program should approach new Information as a 
problem solving activity rather than a clerical one. 

When a human sees a new sentence, he does not simply store 

It away, but he relates It to what he already knows, perhaps 

changing his "programs", or Ignoring the content of the sentence 

and Interpreting something about the person who said It. A 

language understander needs to have an Interpreter which looks 

at each new sentence and decides how to use It. This may 

Include checking It for consistency with what It already knows, 

creating new data or types of data In Its storage, modifying 

theorems, and many other possibilities. This Is discussed 
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further In Section 5.1 

In our system we have four different ways In which 
Information can be accepted In a declarative sentence. The 
first Is a simple word definition facility. If we say "A 'marb* 
is a red block which Is behind a box.", the system recognizes 
that we are defining a new word. It currently recognizes this 
by the quote marks, but It could just as easily declare all 
unfamiliar words as possible new words. We have not done this 
as It would eliminate the feature that the system Immediately 
recognizes typing errors without waiting to be?ln parsing the 

sentence. 

In this kind of definition, the complement of the sentence 
is a noun group, which has an OSS. We save this OSS and 
generate a new dictionary entry for the word, defined 
syntactically as a noun, and with Its semantic definition being 
the program "set the object description to this one we saved 
earlier." Remember that all definitions are programs, so this 
one fits In with no problem. When It is called on to build part 
of the description. It simply Inserts the description used to 
define It. if we talk about "two big marbs", the system will 
build a description exactly like the one for "two big red blocks 

which are behind a box." 

The second kind of Information the system accepts Is simple 
assertions Involving a predicate for which It does not have 
complete knowledge. As we mentioned In Section 3.1, the system 



Section li.2.7 - Page 335 



has complete data about the physical characteristics of the 
objects In the scene. We have selected HIKE as an arbitrary 
relation about which the system knows nothing except what It Is 
told in the dialog. If we say "I like you." this produces the 
assertion (#LIKE :FRIEND :SHRDLU) (the name of the robot Is 
:SHRDLU) which Is simply added co the data base. The system 
also plays a trick with the adjective "nice". Instead of having 
some concept of 'NICE, It assunes that the use of "nice" In 
describing something Is really saying more about the speaker 
than the object, so the definition of "nice" Is 

(NMEANSU*THIKG>(ULIKE :FRIENP •••)))) 
In other words, the person who uses the word "nice" likes the 
object he was referring to. 

If we use an object which Isn't definite, as In "I like red 
blocks.", the system uses the object description to generate a 

simple PLANNER consequent theorem. It creates a theorem of the 
form: 

(THCONSE (XI) 
('LIKE :FRIEND S?X1) 

(THGOAL (#IS S?X1 'BLOCK)) 
(THGOAL ('COLOR $?X1 'RED))) 

This theorem says "Whenever you want to prove that the user 
likes something, you can do It by proving that It Is a block and 
it is red." This Is added to the theorem data base, and can be 
used to answer questions or carry out deductions Involving 
objects described as "nice". The system does not separate types 
of non-definite objects and assumes universal quantification. 
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The results would have been the same if the sentence used "any 
red block", "every red block", "all rod blocks", or (wrongly) "a 
red block." A more complete treatment Is one of the possible 

extensions of the system. 

It does notice the form "no red blocks" and uses this for 
the fourth kind of Information. It sets uo an almost Identical 
theorem, but with a "kicker" at the end. if we say "I like no 
red blocks,", it sets up the theorem: 

(THCONSE (XI) 
(#LIKE :FRIEMD S?X1) ini _,. 

(THGOAL CIS $7X1 #BL0CK)> 

(THGOAL (*COLOR S?X1 »RED))) 

(THFAIL THGOAL)) 

When the system Is trying to prove that we like something, 
this theorem Is called just like the one above. But this time, 
after It finds out that the object is a red block. It does not 
succeed. Instead, It uses the PLANNER function THFAIL In a 

powerful way. Instead of just causing that theorem to fail. It 
causes the entire goal to fall, regardless of what other 
theorems there are. We can also accept a sentence like this 
with a positive NG but a negative clause, as In "I don't like 
the red block" or "I don't like any red blocks." 
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h -2-S Time 

One of the most complex parts of English semantics Is the 
way of establishing temporal relationships. It was pointed out 
earlier that one of the primary differences between the clause 
and other units such as the NG or PREPfi Is the special set of 
mechanisms within the clause for- handling time. In this section 
we will describe how those mechanisms operate both within the 
clause and at other levels of syntax. 

In our formalism for describing relations and events (see 
section 3.1) there Is provision for Including a time reference 
In a relation. The sentence "Harriet saw the film last week." 
might be represented as 

('SEE :HARRIET : F I LM ITIME23) 
where ITIME23 Is an arbitrary name for a structure describing 
the time reference "last week". The semantic programs for 
dealing with time can be described In three parts -- the form of 
structures used to represent time, the way those structures are 
created, and the way they are used In understanding and 
deduction. 

A. Time Semantic Structures 

For the purposes of our BLOCKS world, we have treated only 
a simple part of the overall range of time references In 
English. In particular we have dealt only with references to 
actual events which hove happened In the past or are occurring 
In the present, without dealing with the many varieties of 
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future events, possible events, conditional events, etc. With 
this simplification the system can use a simple linear time 
scale (like a clock), relating all events to specific numerical 
times. This does not mean that a single event must occur at a 
single time — It may continue for a period of time during which 
other events are occurring. 

English makes a clear distinction between events which are 
thought of as occurring at a particular time, and those which 
are pictured as continuing over an Interval. This contrast Is 
expressed both in the choice of verbs and In the shape of the VG 

containing the verb. 

Verbs like "like", and "know", are Inherently progressive. 
They express a relationship which continues over a period of 
time. Verbs like "hit", and "write" are not progressive, but 
indicate the completion of an action as a whole. Of course, 
this action also Involves a process, and there Is a way to 
express this aspect by using a tense PRESENT IN... The sentence 
"I broke It." Is not progressive, giving the feeling of a single 
momentary act. "I was breaking It." emphasizes the process of 
breaking, to which other events can be related. 

In the present tense, the distinction Is clear. The 
present of a progressive verb has the expected meaning, as In "I 
know your name." With a non-progressive verb, there Is a special 
meaning of habitual or repeated action, as In "I break bottles." 
In order to produce the meaning usually considered "present". 
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the verb group must be PRESENT IN PRESENT, as in "I am breaking 
bottles." 

Ambiguities can arise from verbs which are both progressive 
and non-progressive. The question "Did the red block touch the 
green one while you were building the stack?" has two 
Interpretations. One means "Was It In contact during that 
time?", while the other asks "Old it make contact during that 
time?" If the verb were replaced by "support", only the analog 
of the first meaning would be valid, while "hit" would Involve 
the second. The representation for time references must take 
this progresslvlty into account In trying to Internret time 
modi f lers. 

The representation used for time has four elements: the 
tense, an Indicator for progressive, a starting time limit, and 
and ending time limit. Either or both of the limits may be 
omul t ted. Some examples of sentences and their corresponding 
structures are shown In Figure 69. 



A supports B ( PRES) T N0W 

A supported B before time 23 (PAST) T H\ L 23 

0« h 'J B before time 23 (PAST) NIL NIL 23 

You built It after time 2k (PAST) NIL 2k NIL 

You were building It after time 2k (PAST) T 2k NIL 

Figure 69 -- Time Semantic Structures 



The difference between the last two exemnles In Figure 69 
can be visualized by drawing a time line: 
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Non-Progressive 

You built It after time 2U 

i I • 

time begin 

2ii building 



Progressive 
You were bulldlne It after time 2«t 



H 



time 
21. 



finish 
bulldln 



Figure 70 -- Progressive end non-Progressive Times 



A non-progressive action must begin after the start time, 
and end before the end time. A progressive one begins before 
the start time and ends after the end time. The TSS for "you 
hit It during event 23" (assuming event 23 began at time 3 and 
ended at 7) would be 

(PAST) NIL 3 7 
i.e. the hit began after event 23 started and ended before It 
ended. The sentence "you were hitting It during event 23" would 

be: 

(PAST) T 7 3 
I.e. the hitting began before event 23 was over, but ended after 
It had begun. This covers all ways of having the two events 
overlap. The definitions of the relating words like "during" 
and "before" do not have explicit mention of this distinction, 
but the semantic analysis programs take Into account whether the 
verb and VG are progressive In setting up the TSS. 
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B. Setting up Time Structures 

Time Semantic Structures are associated with clauses, and a 
new one Is generated each time a clause Is parsed. Its elements 
are determined by different aspects of the clause structure -- 
the tense depends on the form of the VG, the progresslvl ty 
depends on both the tense and the specific verb, and the limits 
are set by modifiers such as bound clauses, adverbs, and time 
NGs as well as by the tense. 

No analysis Is done until after the VG Is parsed and the 
tense established. Some types of secondary clauses such as IMG, 
SUBING, TO, and SUBTO do not Indicate a tense. There Is a 
potential ambiguity In determining the time reference. "The man 
sitting on the table baked the bread." might Indicate that the 
man was sitting on the table when he baked It, or that he Is 
sitting on the table now. 

Unless there Is a specific reference dike "the man sitting 
on the table yesterday...") the system should take both 
possibilities Into account and resolve them as It would an 
ambiguity caused by multiple senses of words. The current 
system does not do this, but uses a simplifying heuristic. If 
the secondary clause involves PAST, and Is embedded In a PAST 
MAJOR CLAUSE, the two times are assumed the same unless 
specifically mentioned. If the secondary clause has no tenst. 
It Is assumed PRESENT. If It Is PAST, but Imbedded In a PRESENT 
MAJOR CLAUSE, the system checks the time reference of the 
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previous sentence. If this Is PAST, the new one Is assumed to 
be the same (Including whatever modifiers, limits, etc. 
applied). If not It sets up a general time structure for PAST, 
with no beginning Unit, and an end limit of :NOW. A PRESENT 
tense TSS Is represented by the single atom :MOW, which Is 
treated specially by the programs, and Is often deleted from 
relations which Interrogate the current state of the data base 
(see below). It can be applied only to progressive verbs and 
tenses (no provision exists for understanding habitual action). 

Modals are treated like present terse as far as 
establishing time references. A more complete system would 
account for future, different types of modals, more complex 
tenses, and would Involve heuristics for finding the referents 
of multiple tenses like "He will have been going to go 
Immediately for a month by Tuesday." 

The start and end limits are set by modifiers. Adverbs 
like "yesterday" and TIME NG's like "the week he arrived" set 
both limits. This can also be done by bound clauses like "while 
you were building the stack" or PREPGs like "during the flood". 
Other clauses, prepositions, and groups set only the start limit 
(like "after you hit It", "after the war") while others (like 
"before" and "until") set the end limit. In the current system 
the event being referred to In the modifier Is assumed to be 
known along with Its exact time (It must be In the past.) The 
exact beginning and ending time are used In setting the limits. 
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The question "Did you pick It up while you were building 
the stack?" Is answered by first finding the event of building 
the stack (using a TSS for PAST tense with no other limits), 
then using the beginning and ending of that event as limits for 
the TSS In the relation #PICKUP. 

There are discourse phenomena which Involve time reference. 
Plrst, there are specific back-references with words like "then" 
and phrases like "at that time". The system keeps track of the 
major time reference of the previous sentence, and substitutes 
it In the current sentence whenever such phrases are used. This 
time Is also carried forward Implicitly. Consider "Did you pick 
up a red block while you were buttdlng the tower?" "No." "Did 
you pick up a green one?" In this sequence, the second question 
Involves a specific time Interval although It Is not mentioned 
again. Whenever there are two successive PAST sentences and the 
second does not have any explicit time reference, the previous 
TSS Is used. Long dialogs can appear In which the same time 
Interval Is used throughout, but Is mentioned only In the first 
sentence. 

C. Use of TSS 

So far, all of our discussion has Involved the clause with 
Its verb group and time modifiers. But In making use of time 
Information we must handle other units as well. The sentence 
"The man sitting on the table baked the bread." has two 
meanings, but the point would have been Identical for "The man 
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on the table baked the bread." The Qualifying prepositional 
phrase "on the table" does not refer to time, but can be 
Interpreted either as meaning "on the table now" or "on the 
table then". Adjectives can be affected similarly. Consider 

the sentences: 

a. Many rich men made their fortunes durlne the depression. 

b. Many rich men lost their fortunes during the depression. 

c. Many rich men worked In restaurants during the depression. 
The first clearly means "men who are now rich", the second 

"men who were rich", and the thErd ml E ht have either 
Interpretation. The adjective "rich" Involves an Implicit time 
reference, as does any adjective which describes a state which 
can be true of an object at one time, but false at another. 
Nouns can also Involve states which are changeable, and the 
problem would be Identical If "rich men" were replaced by 

"millionaires". 

In a traditional transformational approach, this would be 

used to show that even a simple phrase such as "a rich man" or 

•'millionaires" Is generated by a series of transformations. The 

possibility of two meanings Is accounted for by two different 

deep structures. Involving sentences corresponding to "The men 

were rich." and "The men are rich." This leads to a syntactic 

theory In which the simplest sentence may Involve dozens of such 

transformations, to account for each noun, adjective, 

presposltion, etc. The parser must be able to handle all of 
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these details using syntactic Information. 

In our approach, these can be seen as semantic ambiguities 
which arise within a single syntactic structure. Part of the 
semantic definition of the word "ml 1 1 lonal re" (or "student", 
"bachelor", etc.) Involves a reference to time. Within the 
language for writing semantic definitions, there Is a special 
symbol *TIME. Whenever the program for the meaning of a word In 
the dictionary Is called, the semantic system will have 
determined the appropriate Time Semantic Structure (or 
structures) and have assigned a value to this symbol 
accordingly. If the time reference Is ambiguous, the definition 
will be called once for each possibility. The nnun millionaire" 
might be defined: 

(NMEANS < ('PERSON) (CIS **• #PERS0N) 

(•POSSESS •»• $1,000,000 -TIME)))) 

Notice that not every relation Involves time. Being a 

'PERSON Is assumed to be a permanent characteristic. If the 

time Is PRESENT (Indicated by the TSS (NOW), the system deletes 

the time reference, so PLANNER will receive the expression 

(THGOAL ('POSSESS $?X1 $1,000,000)), where $7X1 Is the variable 

assigned to the object being described. If the sentence were 

"During the war, many millionaires worked In restaurants.", the 

time reference of the sentence would be a structure like ( 

(PAST) NIL 19U1 19U5), and the PLANNER expression for 

"ml 1 llonalre" would Include: 

('POSSESS $7X1 $1,000,000 ((PAST) NIL 1941 191.5)) 
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A different theorem would be used for this case, since It cannot 
look directly Into the data base to see what the person has, but 
must look Into Its past "records" to reconstruct the 
information. In our programs, o record Is kept of when and 
where objects have been moved, so theorems can determine the 
location of any object at any tine In the past. 

Since adjectives can be defined with NMEANS, they are 
treated Identically. PREPosltlons and verbs are usually defined 

Ith CKEAN5, which has the same conventions. The symbol -TIME 
can appear In the PLANNER description In the definition, and Is 
deleted If the applicable time Is I NOW, and replaced with the 
TSS otherwise. The time applicable to anything but a clause Is 
that of the clause closest above it In the parsing tree. This 
is only an approximation, and does not take Into account 
ambiguities such as Illustrated In sentence c. above. In fact, 
a PREP or NG can have Its own tTme reference, as In "a former 
millionaire", "many future students", "my roommate laai y^ax", 
"the man on the table yesterday ". This Is one of many places 
where the current semantic system needs to me extended by making 
the analysis more general. It seems that this could be done 
within the framework of the current system. 
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JuU Semantics a£ Conjunct Inn 

The semantic system does not handle conjunction as 
generally as does the parser. A few cases have been dealt with 
In a simplified way — noun groups, adjectives, RSQ clauses, and 

MAJOR clauses which are not questions. The distinction between 

"and" and "but" Is Ignored. 

With MAJOR clauses, the conjunction must be "and", and the 
components are processed as If they were completely separate 
sentences, except that the response ("OK." for IMPERatlves, and 
"I UNDERSTAND." for DECLARat I ves) Is supressed for all but the 
last. The system will not accept sentences joined with "or", 
or "nor", and will misunderstand compounds which cannot be 
separated Into Individual actions (e.g. "Build a stack and use 
three cubes In It.") 

Noun groups can be connected with "and" wherever they 
appear, and with "or" If they are part of an argument to a 
command (like "Pick up a cube fl£ a pyramid."). An OSS Is built 
with the semantic markers of the first constltutent NG, the 
conjunction Itself, and a list of the OSS for the components. 
If all of the components are DEFInlte and the conjunction Is 
"and", the conjoined NG Is definite, and Its REFERent Is the 
union of the referents. 

The use of the conjoined OSS depends on Its place In the 
sentence. If It Is the object or subject of a verb or 
preposition, the definition of that verb or preposition can 
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check explicitly for conjoined structures and treat them 

specially. For example, "touch" can be defined so that the 

sentence "A and B are touching." will be represented as (THGOAL 

UTOUCH :A :B>). If there Is no special check, the system 

assumes that the desired object Is the list of referents. "A 

and 6 support C." would produce (THGOAL (•SUPPORT (:A :B) :C>>. 

If the first element of the PLANNER expression (usually the name 

of a predicate) has a property MULTIPLE on Its property list, 

the system modifies this to create the expression: 

(THAND(THGOAL('SUPPORT :A :C>> 
(THGOAL(*SUPPORT :B :C>>) 

If the conjoined NG Is one of the arguments to a command, 
the theorem TC-CHOOSE will choose the specific referents. If 
the conjunction Is "and". It will combine the referents for each 
of the components In a single list. If It Is "or". It will 
first choose according to the first constituent, then If a 
failure backs up to the choice. It will try the second, third, 
etc. It does not look at the various choices In advance to 
decide which Is most appropriate for the task being done. 

The other units which can be combined with "and" and "or" 
are the adjective and RSG clause. The semantic structure for 
the conjoined unit Is a list whose first element Is the 
conjunction, and the rest are the Individual interpretations for 
the constituents. In using these to modify an OSS, the system 
combines all of the descriptions with THOR or Implicit THAND. 
For example, "a block which Is In the box 2£ri Is red" becomes: 



Section h.2.9 - Page 3U9 



<THGOAL(»IS $?X #BLOCK)) 
<THGOAL(#IN $?X :BOX)) 
(THGOAL(*COLOR $?X #RED>) 

while "a red ox green block" becomes: 

(THGOALUIS S?X #BL0CK) ) 
(THOR(THGOAU#COLOR $?X #REO)) 

(THGOALUC0L0R $?X #GREEN))) 

This could easily be extended to other modifiers such as 
preposition croups. Many other types of conjunction could be 
handled without major changes to the system, usually by adding 
two bits of program. One would create a conjoined semantic 
structure appropriate to the unit, and the other would recognize 
It and take the appropriate action for Its use. 

Whenever the constituents of a conjoined structure are 
ambiguous, the resultant structure simply multiplies the 
ambiguity, taking all possible combinations of Interpretations. 



- 



" •• 
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i» . 2 . 10 More on Ambiguity 

Section U.l described how the number of Interpretations of 
an ambiguous sentence can be reduced through the use of semantic 
markers, and selection restrictions associated with verbs, 
adjectives, and prepositions. This section will describe the 
mechanism for producing multiple Interpretations, assigning 
plausibilities to them, and resolving the ambiguities through 
discourse heuristics and Interaction with the user. 

Any word of the classes ADV, ADJ, NOUN, PREP, PRON, PROPN, 
VB, CLASF, or PPT can Introduce an ambiguity Into the semantic 
Interpretation. The remaining classes (such as NL'Mber and 
DETermlner) have very limited definitions, and are handled 

dlfferpntly. 

In general, a word Is expected to produce a list of 

semantic structures, based on Its definition, and the other 

lists of semantic structures to which It Is related. NOUM, 

PRON, and PROPN set up lists of Object Semantic Structures. ADJ, 

ADV, and CLASF take one of these lists, and produce a new list 

adding the modification (and posslblly eliminating anomolous 

combinations). The VB, PREP, and PRT (In conjunction with VB) 

set up lists of Relation Semantic Structures, and other classes 

can modify these lists. 

Any of these definitions can involve special programs for 
producing the list of structures. For example the SMIT program 
Is used for analyzing pronouns like "they" and "It". It 
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contains a complex set of heuristics and syntactic criteria to 
find the possible referents of a pronoun and set up an 
Interpretation for each one. For simpler words, the functions 
NMEANS and CMEANS have ways to deal with multiple senses of a 
word. 

First, they both take as an argument not a single 
definition, but a list of definitions, each with Its own 
semantic markers, PLANNER expressions, etc. Second, each of 
them has mechanisms for looping through each of the relevant 
"Input" lists, to produce multiple Interpretations. If the 
subject and object of a clause each have two Interpretations, 
and the verb has three senses, all twelve combinations will be 
tried, and the resulting list of Interpretations for the clause 
will contain as many of them as pass through the semantic marker 
"filters". This Is all done by the function CMEANS without 
mention In the definitions. Third, the definition functions 
have optional ways to establish a "plausibility" rating and 
paraphrase for each meaning. 

In the dictionary, the word "on" has a semantic definition 
which uses the function #0N, which contains: 

(CMEANS (C((#PHYSOB>> <(#PHYS0B))) (ION #1 #2 -TIME) 

NIL 

CCCCPHYSO.,, «.PHVSOB>„ l^tST^J/S^"* 

wtmunt u.physob,,, <£T,T;,?V HE surface) " 
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This contains three different senses. The third definition 
will never conflict with the first two, since It Insists that 
the subject be a place rather than a physical object. The first 
two, however can be applied to the same objects, and therefore 
can be Involved In an ambiguity. Each of them Is given an 
additional argument, to help resolve ambiguities. This argument 
appears after the standard semantic filters, PLANNER relation, 
and semantic markers for the relation. It Is composed of two 
elements -- a LISP form to be evaluated for a "plausibility", 
and a paraphrase of the meaning. In this example, the 
plausibilities are simply numbers. They could just as easily be 
a form like (PLAUSCHECK) , which would call a special prop.ram 
{defined by the user) which could do arbitrary calculations In 
order to decide on a plausibility. This might Involve PLANNER 
deductions, checks on the exact syntax of the sentence, or other 
schemes such as looking through a network or other model In 
order to decide which meaning fits best with the the other words 
in the sentence and the subject being discussed. 

As a semantic structure is built. It takes on the sum of 
the plausibilities of its components, as Its own plausibility. 
No "pruning" Is done to eliminate obvious low plausibilities, 
although this would be a simple addition to the program. All 
interpretations are carried along until the sentence Is totally 

parsed. 

If the sentence Is a command, the system tries to carry out 
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the most plausible Interpretation. If that falls. It tries the 
next, and so on until one succeeds or a total failure causes the 
system to respond »| CAN'T". Questions are handled more 
completely. The system orders the Interpretations by 
Plausibility and finds the answer for the most plausible. It 
then tries ap.aln to answer It, using only Information mentioned 
In the previous sentence and Its answer. If | t succeeds In 
finding the same answer. It deducts 500 from the plausibility, 
since it Is unusual to ask a question to which the answer was 
Just given, either explicitly or Implicitly. If the Information 
In the previous sentence Is not sufficient to answer It, the 
system then tries to answer using only Information which has 
been mentioned previously In the discourse. If this succeeds It 
deducts 200. If the plausibility Is higher than that of the 
next Interpretation by a large enough margin (a factor set by 
the user and called TIMID) It gives the answer as found. If 
not. It saves the answer and repeats the process for the next 
Interpretation. After all Interpretations have been processed, 
the answers are checked to see If they are Identical. In this 
case It doesn't matter which Interpretation Is IntPnded, and the 
system simply E Wes the answer. Finally, If there are differing 
answers, the user must be asked what he meant. Associated with 
each Interpretation Is a list of those places where It differed 
from others. This Is produced automatically by each program 
which accepts multiple definitions (such as NMEANS and CMEANS). 
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Each difference Is marked by two atoms — one whose properties 
Indicate the place In the sentence where the ambiguity was 
produced, and the other Indicating the meaning selected. In 
sorting out the ambiguities, the system looks for two such 
structures with the same first atom but different second ones. 
The first can be used to decide what phrase Is questionable, 
while the second atoms carry the paraphrases. Special care Is 
taken to make sure that the same sentence Interpretation does 
not involve two different interpretations of a single element 

(like "It"). 

Faced with an unresolvable ambiguity/ the system looks 

through the list of Interpretations for a conflict, then 

generates a response like: 

I'M NOT SURE WHAT YOU MEAN BY "ON TOP OF" IN THE PHRASE "ON 
TOP OF GREEN CUBES ". 

DO YOU MEAN: 

1 - DIRECTLY ON THE SURFACE 

2 - ANWHERE OH TOP OF ? 

The response (a typed number) Indicates which is meant, and 
all Interpretations which Involve the other meanings (there can 
be more than 2 associated with a single ambiguity) are 
eliminated. If there arc still conflicting interpretations, 
another ambiguity Is selected and the process Is repeated until 
all those which have not been eliminated give the same answer, 
and It can be used as a response. 

For Interpreting statements, much more subtlety Is needed. 
In general the choice between Interpretations depends on how 
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"reasonable" an Interpretation Is. If we see "The book fell off 
the table because It was slanted.", we avoid believing that "It" 
refers to the book, or that It was "slanted" by the author's 
bias. It would not be reasonable for either of these to be the 
reason for falling off of a table. 

Part of the Interpreter which looks at declarative 
statements must be able to evaluate how "surprising" an 
Interpretation Is, and to choose the one which fits best Into 
the normal expectations. Including the hearer's knowledge about 
the world. This Is not easy to formalize, and was not attempted 
In this project. It could be extended to lower levels to check 
sub-units for reasonableness as they are formed, to avoid 
carrying multiple Interpretations through the analysis of the 
entire sentence. It seems clear that people do this, and a 

truly Intelligent language-understanding program must do so as 
well. 
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i4.2.ii ic Be. aai Ifi Have 

The verbs "be" and "have" are two of the most common words 
In English, and have a complex variety of uses. They appear In 
the system In two ways. First, In the grammar they are treated 
specially since they can occur as auxllllary verbs, as In "I 
would tiayj; fcejin going". In this use, they do not add any 
semantic information except for helping to determine features of 
the VG, such as Its tense. Their other use Is as main verbs In 
clauses like "Do you &&£ > match?" and "He li wrong." As a main 
verb, "be" Is handled specially In the grammar since It can 
enter Into constructions such as "there Is" which do not appear 
with any other verb. However, the semantic analyzer does not 
know anything special about "be" and "have". Their meaning is 

Included In their definitions, which are called as programs just 

like any other verb definitions. 

A. Be 

The use of "be" depends on the specific constellation of 

objects and complements In the clause. The definition Is a 
program containing about bO lines of LISP, which handles those 
meanings relevant to the BLOCKS world (for example, it cannot 
deal with a "role-playing" meaning, like "Laurence Olivier aajL 

Hamlet.") 

Sentences with the feature THERE, Involving a construction 
like "there Is" are represented by the PLANNER expression 
(#EXISTS #1 -TIME). This attaches the correct time, and might 
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be Important for objects which can be created and destroyed, as 
In "Was there a stack...?". 

The other meanings of "be" Involve INTenslve clauses which 
contain an object and a complement. One definition checks for 
COMPQ questions like "What color Is the block?", to generate a 
PLANNER expression (#COL0R :BLOCK $?«). If the complement Is a 
definite NG, as In "Is the green block Jj« biggest object ?" or 
"What Is ih£ blflzest object?", the referent will have already 
been determined, and Is Inserted In a PLANNER expression 
(THAMONG — (QUOTECOBJ))), where :OBJ is the referent. This 
can function In two ways. If the subject Is also definite, as 
In the first example, the ••• will be replaced by Its referent, 
and the statement will succeed only If the two are Identical. 
If the subject Is Indefinite, the THAMONG statement will cause 
It to be assigned to the same referent as the complement. 
If the complement Is a PREPG or a complex ADJG, like 
"bigger than a breadbox", "be" Is only serving as a place-holder 
which can accept a time reference. The semantic Interpreter In 
dealing with a phrase like "on the table" In "Is the block on 
the table?" has already set up a relation of the form (#0N 
:BLOCK :TABLE) which Includes the appropriate time reference. 
In this case, the "be" program simply takes the RSS produced for 
the complement, and uses it as the semantic Interpretation of 
the clause. 

The other possibilities for the complement are an 
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Indefinite NG, a simple ADJG (e.g. a single adjective), or a new 
word. In the case of a NG, the complement NG contains 
additional Information to be ascribed to the subject, as In "e 
large object which Is a. X£<L block". The PLANNER description of 
the complement is stripped from Its OSS, and appended to the 
PLANNER description of the subject. If the subject Is definite, 
as In "Is £hs biggest thine a red blnck7", the referent Is 
known, and can be plumed Into the PLANNER description of the 
complement to see If the description applies. This Is done 
using a pseudo-concept called fHASPROP which triggers the 
mechanisms In the semantic Interpreter. 

If the complement Is a simple ADJG, the ADJG semantic 
specialist creates Its OSS by taking the OSS for the subject, 
stripping away the PLANNER description, and using the rest as a 
skeleton on which to place the PLANNER expression produced by 
the adjective. Once this Is done, it can be treated exactly 

1 Ike an Indef Inl te NG. 

Finally, If the subject or complement Is a new word (as In 

"A frob Is a big red cube." or "A big red cube is a frob.") a 
new definition Is created using the function #OEFINE. The 
definition must be In the form of an Indefinite NG, and the new 
word Is asssumed to be a noun. The semantic definition created 
for the noun contains the OSS which was created for the def inlnj 
NG, and sets this OSS up as the meaning of the noun when It is 
used. 
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B. Have 

The definition of "have" (a also used to handle the 
possessive. For the limited subject matter (and for much of 
English) this Is a good approximation. There are cases where It 
does not apply -- "the painting which John has" Is not 
necessarily the same as "John's painting." The preposition "of" 
also makes use of the same definition. A more complete 
treatment would distinguish between the three, and this would 
Involve only simple changes to the semantic programs. 

The Interesting thing about "have" Is that It Is not used 
to Indicate a few different relationships, but is a place-marker 
used to create relationships dependent on the semantic types of 
the objects Involved. "Sam has a mother." can be represented 
CMOTHER-OF X SAM), "Sam has a friend." Is (-FRIEND X SAM), "Sam 
has a car." Is ('OWN SAM CAR), "Sam has support." Is ('SUPPORT X 
SAM), "Sam has a hand." Is ('PART SAM HAND), etc. The 
definition of "have" (or the possessive, or "of") does not 
Include within Itself all of these different relations. A few 
interpretations (like have-as-part, owning, or having In 
Physical possession) can be reasonably considered distinct 
meanings of "have", and are included In Its definition. The 
others, such as "mother" and "support" really are determined by 
the subject and object. Some systems use this fact to find the 
meaning of special phrases like "client's lawyer" without doing 
syntactic analysis (see section 2.5). Our system uses a 
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different method, allowing a word to be defined as a #R01E. 
"Mother" might be defined as: 

(NMEANS<(#PERS0N #R0LE) 

((•PERSON *••> 
(*M0THER-OF *•* ?) ,,_ 

(#R0LE((«PERS0N))(«M0THER-0F #1 #2))) 

There are two new things In this definition. First, the 
semantic marker *R0LE Is added to Indicate the type of 
definition. Second, a role definition Is Included. It contains 
a semantic filter for objects which can be used In the relation 
(In this case those which could have a mother), and a PLANNER 
statement Indicating the relation (In the same syntax used by 
CHEANS). If the word "mother" Is used In a phrase like "Carol's 
mother" or "Carol has a mother" or "the mother of Carol", the 
system will Insert the right OSS to produce the PLANNER 
description CrM0THER-0F $?X1 CAROL) If "mother" appears In any 
other form, the OSS will contain C*MOTHER-0F $?X1 ?) which will 
be satisfied In a PLANNER goal If XI Is the mother of anyone at 

all. 

Through the 'ROLE mechanism, arbitrary relationships can be 

expressed with "have" (or "of", or possesslves) without bloatln B 
Its definition. There could be more than one #R0LE assigned to 
a word as well. For example "painting" would Involve different 
roles for "Rembrandt's painting" "George Washington's painting 
by Stuart", "the Modern Museum's painting.", etc. 
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^ '2 .12 Additional Semantic Information 
A. Using Clauses as Objects 

In order to Interpret a sentence like "Find a block which 
Is taller than the one I told you to pick up." the system must 
use a clause ("you to pick up") as the object of a verb 
("tell"). It generates a pseudo-object of the type 'EVENT, and 
creates an OSS for that object. In the example mentioned, the 
clause "you to pick up" would have produced the RSS: 



C(NG1 ('PICKUP NG1 ((PAST) NIL NIL NIL)) NIL) (0)) 

reI PLANNER expression neg markers 

Figure 71 -- RSS for "you to pick up" 



NG1 is an OSS describing the object "the one", which the 
system has set up as the object of the clause, and has 
Interpreted as "block". The program SMClb takes this structure 
and produces a corresponding OSS: 



( ( UEVX1) (THGOAL ('PICKUP $?EVX1 $7X1 ((PAST) NIL NIL NIL)) 

(THUSE TCTE-PICKUP))) 
(0 'EVENT #THING) 
('THING)) 
EVX1 

(1 INDEF NIL) 
NIL) 

Figure 72 — OSS for "you to pick up" 



A variable was generated for the event, of the form EVXn, 
and a new PLANNER expression for the event was generated. 
Including the event name as the second element. The reason for 
putting It second Is technical, and should be changed someday 
for programmer convenience and consistency with the scheme 
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described In Section 2.1. In the expression, the nane of the 
OSS Is replaced with Its associated variable <ln this case $?X1> 
since the new structure will be used as part of the description 
of that object. The recommendation list includes the theorem 
which Is designed to deal with expressions Involving time and 
event-names, and Is put In by the system. In working with the 
rest of the sentence, this resultant OSS can be used just like 
any other OSS, as an object of a verb, preposition, etc. 

When PLANNER evaluates the expression. It may have the 
event already stored away, or It may have to deduce that It 
happened by looking at other events. This Is handled by the 
theorem TCTE-PICKUP, and the name of the resultant event Is the 
value which Is assigned to the variable EVX1. 
B. Types of Modification 

There are a variety of ways In which a modifier can affect 
the meaning of the phrase or clause It modifies. Since the 
definition Is a program, the user has great freedom to use 
different types of modification. A time modifier like "now" or 
"then" will modify the Time Semantic Structure associated with 
the clause, an adverb like "quickly" may set up a new relation 
such as (#SPEED S?EV1 »FAST) using the name of the event, while 
others may make changes directly to the relation being 
constructed. The semantic structures previously built can be 
analyzed and modified by an arbitrary function which suits the 
meaning of the modifier. One special facility exists for makln 
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substitutions within an expression. If the PLANNER expression 
of a CMEANS or HMEAKS definition Is of the form (#SU8ST al a2 bl 
b2...), the effect will be to modify the existing semantic 
structure by substituting the atom a2 for al, b2 for bl, etc. 
No new expression Is added to the PLANNER description. The word 
"move" might be defined using: 

(CMEANSt(((#AMIMATE))(<*MANIP))) (#PUT *2 LOC •TIME) C«M>VE)H 

This indicates that moving Is done by an animate object to a 
manlpulable object, and involves putting It at the place "LOC". 
The atom LOC would be given a OSS Indicating an unknown place. 
The resulting RSS has the semantic marker *M0VE. The sentence 
"Move a block." would create a goal ('PUT NG1 LOC), where MG1 Is 
a description of "a block". The theorem for -PUT could then 
choose a block and place. If the sentence Is "Move a block Into 
the box.", the final result should be (I PUT IN NG1 :BOX>. The 
modifying phrase makes a major change In the Internal 
representation of the meaning. 

This change can be done by defining "Into" to Include among 
1 ts meanings: 

(CMEANS((((#MOVE))((#BOX))) < ISUBST #PUTIN *M0VE *1 LOC) NIL)) 
If a PHEPG with the preposition "Into" modifies a clause with 
the semantic marker #MOVE, and the object of the preposition has 
the marker *B0X, then the definition applies. The RS5 for the 
clause Is changed by substituting #PUTIN for #M0VE, and the 
object of the preposition for HOC. The special symbols #1, »2, 
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#3, • — , and -TIME are created as they would be In a normal 
CMEANS or (MEANS definition, being replaced by the appropriate 

object. 

C. Using Evaluation In CMEANS ami NMEANS 

Although every definition has the power to use programs, 
definitions using the standard forms CMEANS and NMEANS ere 
forced Into a rather rigid syntax which does not have a 
procedural character. To avoid this, there Is an extra level of 
evaluation. If the PLANNER portion of a definition Is of the 
form (#EVAL s) where s Is any LISP atom or s-expresslon, the 
form will be EVALled before the description Is used In the 
definition, and Its value used Instead. This value will undergo 
the usual substitutions for U, '2, -TIME, etc. This feature Is 
of particular use In capturing the semantic regularities of the 
language by using auxllllary functions In defining words. Por 
example, color adjectives like "red" and "blue" share most of 
their characteristics. They apply to physical objects. Involve 
a relation with *COLOR, etc. Rather than define them 
separately, we would like a single function 'COLOR which needs 
only to have the exact color specified. The dictionary 
definition of blue would then be (#COL0R I8LUE). The function 
#COLOR can be defined In LISP: 

'TSLmtSSMSJ" 'SeVAL U.STU.ST (QUOTE .COLON 

(CAR A)))))>) 
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When (#C0L0R #BLUE) Is evaluated, the #EVAl will produce 
the form (('COLOR — 'BLUE)), which will then be used by NMEANS 
In the usual way. 

As another example, the word "grasp" can be used to mean 
♦GRASPING (an object being held) or #GRASP (the action of 
closing the fingers around It). The difference depends on 
whether the VG Is progressive or not. The function 
(PROGRESSIVE) finds out whether the clause Is progressive, by 
looking at the verb and the tense. The definition of "grasp" 
can be: 

( CMEANS ( ( t ( ' AN I MATE ) ) ( ( #MAN IP))) 

(fEVAL (COND ((PROGRESSIVE) (QUOTE* #GRASPI NG 12 -TIME))) 
(T (QUOTE (*GRASP #2 •TIME))))) NIL)) 

D. Some Interesting Problems 

There are many areas In which the semantic analysis needs 
to be refined and expanded. The system does not pretend to 
contain a complete analysis of English, but Is rather an 
Illustration of how many aspects of semantics could be handled. 
This section describes a few places where modification might 
begin. 

1. Definite Determiners 

In our system, a definite noun phrase Is Interpreted as 
referring to a unique object or set of objects known to the 
hearer. In more general language use, deflnlteness Is often 
used to convey new Information. The phrase "my brother who 
lives In Chicago" can be said to someone who Is not aware I have 
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a brother, and the effect Is to Inform him that Indeed I do, and 
to tell him where this brother lives. Other nouns can describe 
"functions", so that "the title of his new book", or "my 
address", are allowable even If the hearer has not heard the 
title or address, since he knows that every book has a unlaue 
title, and every person an address. Superlative phrases like 
"the tallest elephant In Indiana" also refer to a unlaue object, 
even though the hearer may not have seen or heard of this object 

before. 

Cases such as these can lead to problems of referential 
opacity. If your name Is "Seymour", and I say "Excuse me, I've 
never heard your name.", it does not Imply that I have never 
heard the name Seymour. The sentence "I want to own the fastest 
car In the world." does not have the same meaning If we replace 
the NG with Its current referent — I don't want whichever car 
It Is that happens to be fastest right now. 

These and other such problems need to be handled In the 
programs for Interpreting a definite NG, using syntactic, 
semantic, and world knowledge. 

2 . Verb Tenses 

The current system Implements only a few of the possible 
tenses " PRESENT, PAST, PRESENT IN PRESENT, PRESENT IN PAST, 
PAST IN PAST, and an elementary form of the MODAL can . 
Section it. 2.8 described some of the problems which can be 
Involved In time reference, and a deeper anlaysls Is needed to 
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account for Interactions between the order of phrases and the 

possibilities for time reference. The mndals, conditionals, 

subjunctives, etc. need to be handled. This may demand a 

version of PLANNER which can temporarily move Into a 

hypothetical world, or which has more power to analyze lts own 

theorems to answer questions Involving modals like "can" and 
"must". 

3. Conjunction 

Only the most elementary problems In conjunction have been 
dealt with In the current system. Many conjoined structures do 
not yet have semantic analyzer programs, and no subtlety Is used 
In deciding on the meaning of words like "and". "And" can be 
used to Indicate temporal sequence ("We wrnt to the circus and 
came home.") causality ("We saw him and understood."), as a type 
of conditional ("Do that a*aln and I'll clobber youl"). In 
specification of how to do something ("Be a friend and help 
ne."), etc. Understanding these uses will be related to the 
discourse problem of the ordering of sentences. For example, 
"The light Is on. He's there." Tndlcates a chain of reasoning. 

In addition, no attempt has been made to disambiguate 
nested structures like "A and B or C", or "the old men and 
women." Syntactic criteria are not sufficient for thes* 
distinctions, and a powerful semantic program will have to be 
used to ask "which Interpretation makes more sense In this 
case". 
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It, Non-syntactic Relations 

There are some places In English where the relation between 
a set of words Is not Indicated by syntactic clues, but Is 
largely based on semantics. One example Is chain of classifiers 
before a noun. In "strict gun law", the law Is strict, but In 
"stolen gun law", the gun Is stolen. It Is possible to combine 
long strings like "a helical aluminum soup pot cover adjustment 
screw clearance sale". In which a large amount of semantic 
Information must be combined with the ordering to find the 
correct Interpretation. The current system handles classifiers 
by assuming that they all separately modify the head. This 
needs to be changed, to use both the semantic markers, and 
complex deductions to find the real relationships. 
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i*.l The Semantics of Discourse 
In section 1.1, we discussed the different types of context 
which can affect the way a sentence Is Interpreted. In this 
section we will describe the specific mechanisms used by our 
program to Include context In Its Interpretation of language. 
We have concentrated on the "local discourse context", and the 
ways rn which parts of the meaning of a sentence can he referred 
to by elements of the next sentence. For example, oronnuns like 
"It" and "they" can refer to objects which have been previously 
mentioned or to an entire event, as In "why did you do _U?'\ 
The words "then" and "there" refer back to a previous time and 
Place, and words like "that" can be used to mean "the one most 
recently mentioned", as In "Explain xhZl sentence." 

In addition to referring back to a particular object, we 
can refer back to a description In order to avoid repeating It. 
He can say: "Is there a small grey elephant from Zanzibar next 
to a big ane.?*'. Sometimes Instead of using "one" to avoid 
repetition, we simply omit part of a phrase or sentence. We can 
reply to "Would you like a corned-beef sandwich?" with "Bring me 
Im." or wc can respond to almost anything with "Why?" In chese 
examples, the second sentence Includes by Implication a Dar t of 
the first. 

These are not really discourse features, since they can 
appear just as well In a single sentence. In fact, there are 
some sentences which would be almost Impossible to express 
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without using one of these mechanisms, such as: "Find a block 
which Is bigger than anything which supports It." These 
mechanisms can be used to refer back to anything mentioned 
previously, whether In an earlier sentence of the speaker, one 
of the replies to him, or something occurring earlier In the 

same utterance. 



o 
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t . ? . l Pronouns 

First we will look at the use of pronouns to refer back to 
ibjects. Since our robot does not know any people other than 
the one conversing with It, it has no trouble with the pronouns 
"you" and "I" which always refer to the two objects :SHRDLU and 
:FRIEMD. A more general program would keep track of who was 
talking to the computer In order to find the referent of "I". 

When the NG program In the grammar finds a MG consisting of 
a pronoun. It calls the program which Is the definition of that 
pronoun. The definitions of "ft" and "they" use a special 
heuristic program called SHIT, which looks Into the discourse 
for all of the different things they might refer to, and assigns 
a plausibility value to each Interpretation. |f more than one 
is possible, they are carried along simultaneously through the 
rest of the sentence, and the ambiguity mechanism decides at the 
end which Is better. Including the last-resort effort of 

rlntlng out a message asking for clarification. If SMIT finds 
two different Interpretations, and one Is chosen because of a 
higher plausibility, the system types out a message to Inform us 

f the assumption made in choosing one interpretation, as In 
Sentence 3 of Section 1.3: 

BY "IT", I ASSUME YOU MEAN THE BLOCK 
WHICH IS TALLER THAN THE ONE I AM 
HOLDING. 

If a response from the user is needed, the request Is typed 
In the same format as the message used for other ambiguities, as 



P 



o 
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In sentence 2b of Section 1.3. In the case of sentence 3, It 



would be: 

I'M HOT SURE WHAT YOU MEAN BY "IT" IN 
THE PHRASE "PUT IT INTO THE BOX" 

DO YOU MEAN: 

1 - THE BLOCK WHICH IS TALLER THAU THE 

ONE I AM HOLDING 

2 - THE ONE I AH HOLDING ? 

A simple transforation Is used to switch "you" with "I", 
and make the corresponding verb changes, and the words are 
borrowed directly from the Input sentences. 

In our discussion of pronouns, we will use "It" as typical. 
In most cases, "they" (or "them") Is treated Identically except 
checking for agreement with plural rather than singular. The 
pronouns "he" and "she" never occur In our limited subject 
matter, but they would be treated exactly like "It", except that 
they would make an extra check to see that their referent Is In 
fact animate and of the right gender. 

The first thing checked by SHIT Is whether "It" has already 
appeared In the same sentence. We very rarely use the same 
pronoun to refer to two different objects In the same sentence, 
so It Is generally safe to adopt the same Interpretation we did 
the first time. If there were several possible Interpretations, 
the system Is careful not to match up one Interpretation from 
one occurrence of "It" with a different one from another 
occurrence In building an overall Interpretation of the 
sentence. 
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Similarly, If "It" was used In the previous sentence. It Is 
likely that If used again ft will refer to the same thing. In 
either of these cases, SHIT simply adopts the previous 
Interpretation. 

Next, a pronoun may be InsEde a complex syntactic 
construction such as "a block which Is bigger than anything 
which supports It." English uses the reflexive pronouns, like 
"Itself" to refer back to an object In the same sentence. 
However, If In go In* from the pronoun to the referent on the 
parsing tree. It is necessary to pass through another Mfi node, 
an ordinary pronoun like "It" Is used, since "Itself" would 
refer to the Intermediate NG. Notice that If we replaced "It" 
by "Itself" In our sentence. It would no longer refer to the 
block, but to "anything". 

SHIT looks for this case and other related ones. When such 
a situation exists, the program must work differently. 
Ordinarily, when we refer to "It" we have already finished 
finding the referent of the f(G being referred back to, and "It" 
can adopt this referent. In this case, we have a circle, where 
"It" Is part of the deflnltlonof the object It is referring to. 
The part of the program which does variable binding In relating 
objects and clauses is able to recognize this, and treat It 
correctly by using the same variable for "a block" and "It". 

The pronoun may also refer to an object In an embedded 
clause appearing earlier In the same clause, as In "Before you 
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pick up the red cube, clear li off." SMIT looks through the 
sentence for objects in such acceptable places co which "It" 
might refer. If It doesn't find them there. It begins to look 
at the previous sentence. The pronoun may refer to any object 
In the sentence, and the meaning will often determine which It 
Is (as In our example about the demonstrators In the Preface). 
We therefore cannot eliminate any of the possibilities on 
syntactic grounds, but can only give them different ratings of 
"plausibility". For example. In Section "..2.5 we discussed the 
Importance of a "focus" element In a clause. "It" Is more likely 
to refer to the previous focus than to other elements of the 
clause. Similarly, the subject Is a more likely candidate than 
an object, and both are more likely than a NG appearing embedded 

In a PREPG or a secondary clause. 

The system keeps a list of all of the objects referred to 
In the previous sentence, as well as the entire parsing tree. 
By using PROGRAMMAR'S functions for exploring a parsing tree, 
SMIT Is able to find the syntactic poslton of all the possible 
references and to assign each a plausibility, using a fairly 
arbitrary but hopefully useful set of values (for example we add 
200 for the focus element beyond what It would normally have for 
Its position as subject or object). In order tn keep the list 
of the objects In the last sentence, our semantic system has to 
do a certain amount of extra work. If we ask the question: "Is 
any block supported by three pyramids?", the PLANNER expression 
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produced Is: 

(THFIND ALL $?X1 (XI) 

(THGOALtHS $7X1 *BL0CK)) 

(THFIND 3 s?X2 (X2) 

<THG0AL(MS $?X2 (PYRAMID)) 
(THGOALUSUPPORT J?X2 $?X1)))) 

Once this Is evaluated. It returns a list of all the blocks 
satisfying the description, but no record of what pyramids 
supported them. If the next sentence asked "Are they tall?", we 
would have no objects for "they" to refer to. Special 
Instructions are Inserted Into our PLANNER descriptions which 
cause lists llkp this to be saved. The actual PLANNER 
expression produced would he: 

(THPUTPROP {QUOTE XI) 

(THFIND ALL $?X1 (XI) 

(THGOALUlS $?X1 'BLOCK)) 
(THPUTPROP (QUOTE X2) 

(THFIND 3 S?X2 (X2) 

(THGOALCIS $?X2 ^PYRAMID)) 
(THGOALUSUPPORT $?X2 $?X1))) 
(QUOTE BIND))) 
(QUOTE BIND)) 

This only occurs when the system Is handling discourse. 

Finally, "It" can be used In a phrase like "Do l_tl" to 
refer to the entire main event of the last sentence. This 
LASTEVENT Is saved, and' SHIT can use It to replace the entire 
meaning of "do It" with the description generated earlier for 
the event. 

When "that" Is used In a phrase like "do that". It Is 
handled In a similar way, but with an Interesting difference. 
If we have the sequence "Why did you pick up the ball?" "To 
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build a stack." "How did you do It?", the phrase "do It" refers 
to "Pick up a ball". But If we had asked "How did you do 
that?". It would refer to building a stack. The heuristic Is 
that "that" refers to the event most recently mentioned by 
anyone, while "It" refers to the event most recently mentioned 
by the speaker. 

In addition to remembering the participants and main event 
of the previous sentence, the system also remembers those In Its 
own responses so that It can use them when they are called for 
by pronouns. It also remembers the last time reference, 
(LASTIHE) so the word "then" can refer back to the time of the 

previous sentence. 

Special uses of "It" (as In "It Is raining.") are not 
handled, but could easily be added as further possibilities to 
the SMIT program. 
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iUJul Substitutes an^ Incompletes 

The next group of things the system needs to Interpret 
Involves the use of substitute nouns like "one", and Incomplete 
noun groups like "Buy me laa." Here we cannot look back for a 
particular object, but must look for a description. SMIT looks 
through a list of particular objects for Its meaning. SMONE 
(the program used for "one") looks back Into the Input sentence 
Instead, to recover the English description. "One" can be used 
to stand for part or all of that description. 

As with "It", "one" can refer back to something In a 
previous sentence, the previous reply, or earlier In the same 
sentence. Here though, there are no restrictions about where 
In the parsing tree the description can He. "One" depends more 
on surface characteristics than on structural differences. For 
example. It cannot refer back to a NG which Is a pronoun or uses 
a TPRON like "anything". Our program for "one" Is not as 
complex as the one for "It". It Is primarily based on the 
heuristic of "contrast". People often use "one" to contrast two 
characteristics of basically similar objects, for example "the 
big red block and the little fine.." The program must understand 
these contrasts to Interpret the description properly. We 
realize that "the Htle one" means "the little red block", not 
"the little big red block" or "the little block". In order to 
do this, our system has as part of Its semantic knowledge a list 
of contrasting adjectives. This Information Is used not only to 



Section U.3.2 - Page 578 



decide how much of the description Is to be borrowed by "one", 
but also to decide which description In a sentence "one" Is 
referring to. If we say "The green block supports the big 
pyramid but not the little one." It Is fairly clear that "one" 
refers to "pyramid". But If we say "The big block supports the 
green pyramid but not the little one.", then "one" might refer 
to "block". The only difference Is the change of adjectives -- 
"big" and "little" contrast, but "green" and "little" do not. 
Our program looks for such contrasts, and If It finds one. It 
assumes the most recent contrasting description Is the referent. 
If there Is no contrast between the phrase being analyzed and 
any NG In the same sentence, previous answer, or previous 
sentence. It then looks for the most recent NG which contains a 

noun. 

It Is Interesting to note that SHONE causes the system to 
parse some of Its own output. In order to use the fragment of a 
NG it finds, SMONE must know which elements It can use <such as 
noun, adjective, and classifier) and which It does not (such as 
number and determiner). For the noun groups In previous Inputs, 
the parsing Is available, but for the reply, only the actual 
words are available and It Is necessary to construct a simple 
parsing before understanding the meaning of "one". It does not 
call the entire system recursively to do this, but uses ■ 

slmpl If ied vers Ion. 

An Incomplete NG, containing only a number or quantifier Is 
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used In much the same way as "one". In fact. If we look at the 
series "Buy me three." "Buy me two." "Buy me one.", we see 
they are nearly Identical. We can take the view that an 
Incomplete NG actually has an Implied substitute noun of "one". 
This Is the way our program handles Incomplete noun groups. 
Currently the set of contrasts Is stored separately as 
special properties In the dictionary entries of the adjectives 
Involved. It would be better to combine this with the semantic 
marker system, or the actual system of PLANNER programs and 
concepts. 
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We have discussed several ways of using overall discourse 
context In understanding. We have so far experimented with only 
one of these — keeping track of what has been mentioned earlier 
In the discourse. This Is not the same as looking back In the 
previous sentence for pronoun references, as It may Involve 
objects several sentences back or occurring In separate 
sentences. If there are many blocks are on the table, we can 
have a conversation: "What Is In the box?" "A block and a 
pyramid." "What Is behind It?" "A red block and another box." 
"What color Is the box?" "Green." "Pick up the two blocks." 

The phrase "the two blocks" ts to be interpreted as a 
particular pair of blocks, but there may be others In the scene, 
and nowhere In the dialog were two blocks mentioned together. 
The system needs a way to keep track of when things were 
mentioned, in order to Interpret "the" as "the most recently 
mentioned" In cases like this. 

To do so, we use PLANNER'S facility for giving properties 
to assertions. When we mention a "green block", the semantic 
system builds a PLANNER description which Includes the 

expressions: 

(THGOAHMS $?X1 «8L0CK)> (THGOAL( 'COLOR $?X1 «GREEN>> 
After the sentence containing this phrase has been Interpreted, 
the system goes back to the PLANNER descriptions and marks all 
of the assertions which were used, by putting the current 
sentence number on their property lists. This Is also done for 
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the assertions used In generating the descriptions of objects In 
the answer. 

When the semantic programs find a definite NG like "the two 
red blocks", the second NG specialist (SHNG2) uses PLANNER to 
make a list of all of the objects which fit the description. If 
there are the right number for the NG, these are listed as the 
"reference" of the NG, and the Interpretation of that NG Is 
finished. If there are fewer than called for by the determiners 
and numbers, SMNG2 makes a note of the English phrase which was 
used to build the description, and returns a message to the 
parser that something has gone wrong. 

If Che parser manages to parse the sentence differently, 
all Is well. If not, the system assumes that the NG 
Interpretation was the reason for the fallurp, and the system 
uses the stored phrase to print out a message "I don't 
understand what you mean by..." 

However, If there are too many objects which match the 
description, SMNG2 tries to find out which were mentioned most 
recently. It does this by using PLANNER to recheck the 
description for the I terns It found, but this time using only 
those assertions mentioned In this or the previous sentence. 
This Is easily done by using PLANNER'S ability to put a "filter" 
on the assertions to be retrieved from the data base, allowing 
only those which are accepted by a LISP predicate (which In this 
case looks for a sentence number In the appropriate range on the 
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property list of the assertion. See <Sussman> for details.) If 
It finds the right number, these must be the reference of the 
NG. If It finds too few. It can reiterate the procedure, but 
using all of the assertions mentioned In the last two sentences. 
This backward progress continues until at some point either It 
finds the right number or the number found jumps from below the 
right number to above It. In this case a message of failure Is 
returned to the parser as before, but a marker Is set so that In 
case the sentence cannot be understood, the message returned Is 
"I don't know which... you mean", as In sentence 2 of Section 

1.3: 

I DOM'T UNDERSTAND WHICH PYRAMID YOU KEAN. 
Of course, a more sophisticated program Is needed for 
determining the referents of definite NGs. If the conversation 
Is about a book, the NG "the title" Is clearly acceptable and 
must refer to the title of that book, even though no title was 
previously mentioned. If there Is a complicated NG like "the 
biggest cat In the world" or "the thing I want to do most", the 
program must deduce from the form of the NG that there Is only 
one such object, even though It has not been previously 
mentioned, and that It Is being defined by this description. 
There Is a large area for expansion In dealing with these and 

similar problems. 

It seems likely that these and other similar aspects of 

language can be handled systematically, using the formalism of 
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procedures within a system like the one described In this paper. 
The semantic and deductive systems provide a framework within 
which we can formalize and combine a variety of aspects of 
meaning and knowledge to deal with the real problems of 
semantics. 
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k.U Generation of Responses 
In this section we will describe how our language- 
understanding system generates Its linguistic responses. This 
aspect was not emphasized as much in the research as the problem 
of understanding language, and w« have not developed a general 
theory of discourse generation. The programs produce behavior 
which seems reasonable for carrying on a dialog about the robot 
world. In particular we dealt with four different aspects of 
discourse production - patterned responses, appropriate answers 
to questions, generating descriptions of objects and events, and 
the production of fluent discourse. 

U.tt.l Patterned Responses 

The easiest way to get language behavior from a computer 
program Is to Include a set of fixed responses which are 
appropriate for the situations It will encounter. These 
responses can be made as complex and varied as we want, since 
they are created by the programmer, and the program only repeats 
them. Many language systems make heavy use of these fixed 
responses to achieve "good sounding" output. 

In general, these responses are not flexible enough for a 
true language system, but there are places where they are 
appropriate — for example, when the program wants to 
acknowledge that something has happened, or that It has reached 
acertainpoint in Its analys.s. Our system uses several types 
of patterned response. First there Is a set of simple responses 
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for specific situations. The system responds "OK." when a 
command Is carried out, "I UNDERSTAND." when a declarative 
sentence Is analyzed, "I DON'T UNDERSTAND." when a sentence 
cannot be analyzed, and "I CAN'T." when a command cannot be 
executed. 

A slightly more complex type of response Involves "filling 
In the blank" with a phrase borrowed from the Input. The 
simplest example In our system Is "SORRY, I DON'T KNOW THE WORD 
"...", PLEASE TYPE TWO SPACES." The offending word Is taken as 
It was Input and Inserted In the blank. The "two spaces" are to 
clear the Input buffer of characters typed after the message was 
sent. Two slightly more complex types of response Involve 
manipulating the determiners of the phrase which was Input. If 
the user types something like "the three green pyramids", and 
the system cannot figure out what he Is referring to. It t; 
"I DON'T KNOW WHICH THREE GREEN PYRAMIDS YOU MEAN." It ha: 
simply replaced "the" with "which" before filling the blank. 
The "I assume" mechanism does the opposite, replacing an 
Indefinite determiner with "the". If we talk about "some green 
pyramid" or "a green pyramid", then later refer to that pyramid 
as "It", the system can notify us of Its Interpretation of "It" 
by saying "BY 'IT' I ASSUME YOU MEAN THE GREEN PYRAMID." Here 
the system has removed the Indefinite determiner or Quantifier 
("»" or "some") and replaced It with "the", it uses knowledge 
about various kinds of determiners to fill In the pattern 
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reasonably In a number of cases (for example, possesslves are 

left alone, since they are already definite). It can also 

handle the use of "they" to refer to several objects. 

Our most complex "b»ank-f 11 1 In*" response Is the one which 

handles ambiguity. It uses parts of the Input sentence and 

pieces of the definitions of words stored In the dictionary. In 

the response to sentence 2U of the sample dialog (section 1.3): 

2i». how many things are on top of green 
cubes? 
the system typed: 

I'M NOT SURE WHAT YOU MEAN BY "OH TOP 
OF "IN THE PHRASE "ON TOP OF GREEN 
CUBES " . 

DO YOU MEAN! 

1 - DIRECTLY ON THE SURFACE 

2 - ANWHERE ON TOP OF T 

The two phrases "on top of" and "on top of green cubes" 
were taken directly from the Input, and the paraphrases 
"directly on the surface" and "anywhere on top of" were part of 
the dictionary definition provided for "on". One of the answer 
routines looks at the list of places where the Interpretations 
differ, and generates this request for clarification. The 
response must eliminate at least one possibility, end the 
program continues generating questions like this one until only 
one Interpretation remains. If the ambiguity Is caused by 
multiple Interpretations for a pronoun, the English text of the 

phrases originally used to refer to the objects are used In the 

response choices • 



Section k.b.2 - Page 387 



LJul Answering Questions 

In order to carry on a discourse, we need to know what types 
of responses people expect to different types of questions. In 
Section 2.3.3 we classified various types of questions 
syntactically, and we can use this classification to select 
appropriate answers. The system answers whenever possible with 
a complete phrase, providing as much Information as possible. 
No attempt Is made to produce full sentences, since In fact 
people rarely answer questions with sentences. The natural 
response to "Which block Is In the box?" Is "the red block", or 
"the red one", certainly not "The red block Is In the box." We 
have tried to analyze what sort of phrases provide the most 
Informative and natural responses. 

The first case Is the WH- question, and within that class, 
the NGQ<see section 2.3.2), Involving a question NOUN GROUP, 
such as "which block", "who", "how many pyramids", or "what". 
We can divide these Into three classes — "how many", "vague", 
and "specific". If asked "What Is In the box?", we can answer 
"a blue block and a pyramid" without being more specific at 
Identifying them. If Instead we are asked "Which block Is In 
the box?" we must use a more specific description like "the 
large blue cube which supports a pyramid." The program must 
generate English descriptions of particular objects, and It must 
be able to generate both definite and Indefinite descriptions. 
This part of the program will be described In the next section. 
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The use of Us results Is straightforward for NGQ questions. If 
the NG Is "what", we generate Indefinite descriptions of the 
object or objects. If It Is "which...", we generate a definite 
description. "Who" Is never a problem, since the system only 

knows of two people, "you", and There are also default 

responses, so that a question like "Which block supports the 
table?" can be answered "HONE OF THEM." 

HOWMANY questions are answered with the number of 
appropriate objects, followed by "OF THEM" to make the discourse 
smoother. For example, the response to dialog sentence 6, "How 
many blocks are not In the box?", Is "FOUR OF THEM." 

The next type of question Is the QADJ, such as "why", 
"when", "how", or "where". The only three which have been 
Implemented so far are "why" "when", and "how", but the others 
can be done In an analogous fashion. A "why" question Is 
answered using the system's memory of the subgoals It used In 
achieving Its goals In manipulating toy objects. If we can 
decide what event Is being referred to In the question, we can 
see what goal called It as a subgoal, and answer by describing 
the higher goal In English. If the event was Itself a top level 
goal. It must have been requested as a command, and the response 
Is "BECAUSE YOU ASKED ME TO.". 

We need a program which creates an English description of 
an event from Its PLANNER description like C#PUT0N :B3 t TABLE). 
It must generate phrases which refer to the objects Involved, 
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and combine them Into a clause of the proper type with the 
proper tense. This program Is also described In the next 
section. 

We can use the same event-descr Iber to answer "how" 
questions by describing all of the events which were subgoals 
used In achieving the event mentioned. We say "BY...", then 
list each event In an "Ing" form, as In: "BY PICKING UP A RED 
BLOCK AND PUTTING IT IN THE BOX." If the event was Itself a 
lowest-level goal, the system has no way of analyzing Its own 
programs for achieving that goal, and answers "I CAN'T ANALYZE 
HOW". 

"When" questions are answered similarly -- a time Is named 
by describing the top-level goal which was being carried out at 
the time, saying "WHILE I WAS..." and using the "Ing" form to 
describe the event. This Is Inappropriate If the question 
refers directly to the top-level goal (We can't answer "When did 
you build the stack?" with "WHILE I WAS BUILDING THE STACK."), 
and In that case we say "BEFORE..." and name the top-level goal 
Immediately following In time. If the goal mentioned was just 
done, the system replies "JUST NOW.". In addition to the normal 
responses, the system has a set of fixed responses such as 
"NEVER", and "I CAN'T EXPLAIN A NON-EXISTENT EVENT." to answer 
questions which demand them. 

Finally we come to YES-NO questions which, paradoxically, 
are the most complicated. It seems that a one word answer Is 
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called for, but this Is often Impossible and rarely the best way 
to respond. If we ask "Does the block support three o/ramlds?", 
and In fact It supports four, what Is the correct answer? The 
system could ask for clarification of the Implicit ambiguity 
between "at least three" and "exactly three", then answer "yes" 
or "no". But It Is more efficient and helpful to answer "FOUR 
OF THEM", leaving the speaker to Interpret his own question. 
If there were only two pyramids, the system could respond "HO", 
but It would be more Informative to say "NO, ONLY TWO OF THEM". 
In any of these cases, we might be even more complete, and say 
something like "FOUR OF THEM: A BLUE ONE, TWO RED ONES, AND A 

LARGE GREEN ONE." 

There are three parts we can Include In the answer -- "yes" 
or "no", a number, and a description of the objects. Remember 
that whenever possible a YES-NO question Is treated like a MO 
question by looking for a focus element (an Indefinite NG In the 
question). A question like "Does the pyramid support a block?" 
Is treated logically like the question "Which block does the 
pyramid support?" , or "How many blocks does the pyramid 
support?" All three send a THFIHO ALL request to PLANNER, asking 
for all of the objects fitting the description: 

(THPROG (X) 

(THGOAL (#IS $?X 'BLOCK)) 
(THGOAL (^SUPPORT :B5 4?X>)) 

where :B5 Is the system's Internal name for the pyramid being 

referred to by "the pyramid" (this would be determined In 
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separate calls to PLANNER). In the case of the HOWMANY Question 
we answer with the number of objects found. For the "which" 
question, we name the objects. In the case of a YES-NO 
question, we answer with all three kinds of Information, saying 
"YES, TWO OF THEM: A LARGE RED ONE AND THE GREEN ONE." 

The first element Is "YES" If the answer Is clearly yes 
(for example If the number Is matched exactly, or the number In 
the original question was Indefinite as In this example), "NO" 
if It Is clearly no (for example If there are fewer than 
requested, none at all, or the request was of a form 
"exactly..." "at least..." "more than..." etc. and was not met), 
and Is omitted If there Is a question about Its Interpretation 
(as described above). 

The second element, the number. Is omitted If the number 
found matches the request (For example, "Are there three 
blocks?" Is not answered redundantly, "YES, THREE OF THEM: A 
GREEN ONE AND TWO LARGE RED ONES."). The phrase "OF THEM" 
following the number Is changed to "THINGS" If the focus 
contains a TPRON like "anything", or "something". If the number 
found U less than that In the focus. It is preceded by 
"ONLY... 1 , so the answer comes out "MO, ONLY TWO OF THEM:...") 

At the end of a response, we put the description of the 
objects found, unless the request used a special number format 
such as "exactly...", "at least..." etc. In which case the 
system assumes the number Is more Important than the specific 
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objects. We use the object-naming program In Its Indefinite 
mode. If the focus originally appeared as the object of a 
preposition, we repeat that preposition before the description 
to clarify the answer. Thus, "Is the pyramid on a block?" Is 
answered "YES, ON A LARGE GREEN ONE." The unknown agent of a 
passive like "Is It supported?" Is Implicitly the object of 
"by", so the answer Is "YES, BY THE TABLE." If a YES-NO 
question contains no possible focus since all of Its NGs are 



DEFInlte, as In "Does the table support the box?", the sys 



answers simply "YES" or "NO". 



tem 
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kJul Nfl^lng Objects and Events 

The previous section covers all of the different types of 
questions the system can handle, and the types of phrases It 
uses In response. We have not yet explained how It names an 
object or describes an event. This Is done with a set of 
PLANNER and LISP functions which examine the data base and find 
relevant Information about objects. These programs take 
advantage of the fact that the subject matter Is limited, in 
general, the way an object Is named Is highly dependent on what 
the person being spoken to Is Interested In and what he already 
knows. This has not been dealt with yet. Certain features of 
objects, such as their color and size, are assumed to be the 
best way to describe then In all contexts. 

First we need to know how the object Is basically 
classified. In the BLOCKS world, the concept 'IS represents 
this, as In (MS :HAND 'HAND), (#IS iBl 'BLOCK), and (#15 'BLUE 
'COLOR). The naming program for objects first checks for the 
unique objects In Its world, "I", "you", "the table", "the box", 
and "the hand". If the object Is one of these, these names are 
used. Next It checks to see If It Is a color or shape. In which 
case the English name Is simply the concept name without the 
"#". The question "What shape Is the pyramid?" Is answered 
"POINTED." since It has the shape 'POINTED. If the the object 
Is not one of these and Is not a 'BLOCK, 'BALL, or a 'PYRAMID 
the program gives up. If It Is one of those three, the correct 
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noun Is used (Including a special check of dimensions Co see If 
a »BLOCK Is a "cube"), and a description Is built of Its color 
and size. At each stage of building, the description Is checked 
to see If It refers uniquely to the object being described. If 
so, the determiner "the 1 ' Is put on, and the description used 
without further addition. If there is only one ball In the 
scene. It will always be referred to as "the ball . 

If the description Includes color and size, but still fits 
more than the desired object, the outcome depends on whether a 
specific description or an nonspecific one Is called for. If It 
Is nonspecific, the program puts the Indefinite pronoun "a" or 
"an" on the beginning and produces something like "A LARGE GREEN 
CUBE". If It Is specific, more Information Is needed. If the 
object supports anything, the program adds the phrase "WHICH 
SUPPORTS..." then Includes the English descriptions (Indefinite) 
of all the objects It supports. If the object supports nothing, 
the program adds "WHICH IS TO THE RIGHT OF..." and names all of 
the objects to the left of the desired one. This still may not 
characterize the object uniquely In some situations, but the 
system assumes that It does. If at any point In the dialog, an 
object Is given a proper name, it Is referred to using only the 
noun and the phrase "NAMED...", as In "THE BLOCK NAMED 
SUPERBlOCK." 

Naming events Is relatively straightforward. With each 
event type (such as »PUTON or *STACKUP) we associate a small 
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program which generates an English name for the event and 
combines It properly with the names of the objects Involved. 
For example, the definition for »PUT0N Is: 

(APPEND (VBFIX (QUOTE PUT)) OBJl (QUOTE (ON)) 0BJ2) 
VBFIX Is a program which puts the verb Into the right form for 
the kind of clause needed to answer the question. (for example, 
-Ing for answering "how", or Infinitive for answering "why"). 
It takes Into account the changes In spelling Involved In adding 
endings. OBJl and 0BJ2 are bound by the system to the English 
names of the objects Involved In the event, using the object- 
naming program described above. APPEND Is the LISP function 
which puts together the four Ingredients end to end. We 
therefore get descriptions like "BY PUTTING A LARGE RED CUBE ON 
THE TABLE". There Is a special check for the order of particles 
and objects, so that we output "TO PICK UP THE SMALL BLUE 
PYRAMID.", but, "TO PICK IT UP" rather than, "TO PICK UP IT". 
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fa.b.t. Generating. Discourse 

The previous sections described a generating; capability 
which can produce reasonable English answers to different types 
of questions, but used by themselves, the features described 
would produce awkward and stilted responses which would at times 
be Incomprehensible. Even though we have mentioned some 
discourse-like patterns < 1 1 ke "...OF THEM" following a number), 
we have not yet discussed the real problems of discourse. The 
system uses three different discourse devices In producing Its 
answers. These are much more limited than the range of 
discourse features It can understand, but they are sufficient to 

produce fluent dialog. 

The first problem Involves lists of objects. Our Initial 
way of naming more than one object Is to simply string the 
descriptions together with commas and "AND". We might end up 
with an answer like "YES, FOUR OF THEM: A LARGE BLUE BLOCK, A 
SMALL RED CUBE, A SMALL RED CUBE, AMD A SMALL RED CUBE." To 
avoid this redundancy, the object-namer looks for Identical 
descriptions and combines them with the appropriate number to 
get "A LARGE BLUE BLOCK AND THREE SMALL RED CUBES." (Note that 
It also must change the noun to plural). 

The next problem Is the use of substitute nouns. We would 
like to respond to "Is there a red cube which supports a 
pyramid?" by "YES, A LARGE ONE." Instead of "YES, A LARGE RED 
CUBE." By comparing the English descriptions of the objects with 
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the wording of the focus In the Input sentence, we can omit 
those nouns and adjectives they share and replace them by "ONE". 
The third problem Is more serious, as Ignoring It can lead 
to Incomprehensible responses. Consider the answer to question 
32 in the dialog ("How did you do It?"). If we did not use the 
pronoun "It" or the determiner "that", the response would be: 

SI PTTiS? S"S?I RFD BL0CK 9H T HE TABLE, THEN LETTING GO 
?!«Sr L i R S E ED BL0CK ' ™EN PUTTING A LARGE GREEN CUBE ON A 
LARGE RED BLOCK, THEN LETTING GO OF A LARGE GREEN CUBE 
THEN PUTTING THE RED CUBE ON A LARGE GREEN CUBE, THEN 
LETTING GO OF THE RED CUBE. 

How many different blocks and cubes are Involved? In describing 

events, we must have some way to Indicate that we are referring 

to the same object more than once. We can do this using "It" 

and "that", and at the same time can use these words to Improve 

the smoothness of the discourse In other ways. The system has 

heuristics which lead It to use "It" to refer to an object In an 

event It Is describing whenever: 1. the same object was called 

"It" In the question. 2. the object was called "It" In the 

question preceding the current one, and "It" was not used In the 

current one. 3. the object has already been named In the 

current answer, and was the first object In the answer k. no 

objects have yet been named In the current answer, and the 

object was the only one named In the previous answer. 

To refer to an object already named In the current answer, 

other than the first, the program applies the determiner "that" 

to the appropriate noun, to get a phrase like "BY PUTTING A 
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GREEN BLOCK ON A RED CUBE THEN PUTTING THAT CUBE IN THE BOX." 
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JUl^I Future Development 

The generation of language Is a complex subject, wide open 
to future development. Our current system [ a just a beginning, 
and has some major deficiencies. First, we would like to 
describe an object by using facts which are relevant to the 
context. In our simple world, we have declared by flat that 
color, sl«, and support relationships are the Important facts 
about an object. We could just have well have used location to 
get answers like "the block nearest to the back of the table". 
With a wider range of subjects, we would need much more 
sophisticated heuristics for deciding what features of an object 
will serve best to Identify It to the hearer. 

Second, we do not have a way to turn an arbitrary PLANNER 
expression Into English. We can handle only specific objects 
and simple events. There are a number of applications for a 
more powerful English generator. For example. In case of 
ambiguity, we shouldn't have to Include special paraphrases In 
the definition. The system should be able to look at the two 
PLANNER descriptions and describe the difference directly In 
Engl Ish. 

The system should be able to tell us more about Itself and 
how It does things. If we ask a question like "How do you build 
stacks?", it should be able to look at Its own programs and 
convert them to an English description like "First I find a 
space, then I choose blocks, then I put one of the blocks on 
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that space, then..." PLANNER'S structure of goals and subsoals 
Is Ideal as a subject for this kind of description, and a great 
deal could be done along this line. In a more speculative vein, 
the development of discourse generators which could convert an 
Internal logical format Into natural language might lead to 
computer essay writers, or translators which could understand 
the material they were working with. 
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h.5 Comparison with Other Semantic Systems 
friSil Introduction 

This section compares our semantic system with two other 
models for the semantics of natural language. Each has served 
as the basis for computer programs which "understand" language, 
and we will consider the usefulness of the models for this 
purpose as well as their value as theoretical models of a 
natural process. 

The three models can be labelled "categorization", 
"association", and "procedure". These do not represent a cross 
section of semantic theories, but represent one particular type 
of theory. They assume that It Is meaningful to postulate a 
conceputal organization of human knowledge, related by semantics 
to the linguistic forms used In expressing thoughts. This sets 
them off from traditional approaches which avoided postulating 
mental Istlc structures and dealt Instead with extra-mental 
representations such as logical truth conditions or stimulus- 
response relationships. 

These three models are oriented towards viewing language as 
a human activity rather than an abstract calculus of symbols. 
They study the process In terms of human models, and take Into 
account the production and Intepretatlon of language. One 
possible reason why the current syntactic theories have been 
weak In develo P ln E semantic theories Is their Insistence on a 
"neutral" characterization of the competence of a language user. 
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without regard to the process carried out In an Intelligent 
speaker or hearer. Since semantics Involves the Interaction 
between the structures of the language and the knowledge and 
Intelligence of the language user. It cannot be understood 
without trying to deal with this Intelligence directly. 
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lUl^l Categorlratlpp 

The Categorization model of semantics was develooed In Its 
best known form by Katz and Fodor <Katz>, and has been a part of 
many computer systems for understanding natural language. 

The basic principle Is a structure of categories, called 
"semantic markers", dividing the conceputal world much as the 
Dewey decimal classification subdivides the books In a library. 
The usual sense of the word "bachelor" has the semantic markers 
"male", "human", "animate", "physical object", etc. and the 
final distinguishing characteristic ("never having been 
married") Is Its "semantic dlst Ingulsher". 

In choosing between different senses of a word In a 
particular sentence, these markers are combined according to 
"projection rules". For example, the word "colorful" would be 
Interpreted In one sense In "colorful cube", another In 
"colorful party", while the rules would Indicate that "a 
colorful ball" has two possible readings. 

Further Information can be gleaned from the logical 
relations between the markers such as the fact that "male 
uncle" Is redundant, while "female uncle" Is anomalous, 

Fodor and Katz did not attempt to explain the process of 
producing and understanding language In terms of these markers, 
preferring to see them as abstract neutral relationships 
underlying the speaker's competence. They did not deal In any 
systematic way with those aspects of meaning which cannot be 
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dealt with through this type of categorization. 

Nevertheless, categorization has been used In many computer 
programs for understanding natural language, to help choose the 
right meaning for potentially ambiguous words. Schank <Schank 
1969, 1970> has extended the application of this theory, using 
semantic relationships to parse sentences conceputal ly. 
Associated with each sense of a word Is a conceptualization, 
specifying the semantic relationships of that word with other 
words In the same structure. For example, one meaning of the 
word "hit" would be an action of physical striking, whose 
subject Is a "person", whose object Is a "physical object", and 
which has a possible Instrument of the category "weapon". The 
sentence "I hit the boy with a stick." would be parsed by 
noticing words In categories which could fill the roles, and by 
setting up an appropriate structure. It could also account for 
the Interpretation In which "hit" Involves striking with a fist, 
and "with" represents possession of a "physical object" by a 
"person", but this would be found only on "prompting" (see 

<Schank 1970>, p. 26). 

The underlying belief Is that humans make much use of this 
sort of categorization In understanding sentences, rather than 
doing a complete syntactic parsing. The sentence: 

"The window the ball the boy threw hit broke." 
Is understood more easily than: 

"The man the woman the girl knew liked died." 
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Fodor and Garrett <Fodor I967> studied sentences like these, and 
found that they are more easily understood when the categories 
associated with the verbs can Indicate the conceptual structure. 
For sentence fragments and ungrammatlcal utterances, this 
abll Ity seems vital . 
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fa. 5.5 Association 

The second model has Its roots more in psychology, and the 
presence of "associations" between words In a person's mind. It 
postulates a sprawling collection of words and concepts, 
connected to each other by simple links. If "goose" Is 
connected to "quill" and "quill" to "pen" and "pen" to "Ink", 
there Is a path of length three from "eoose" to "Ink". These 
links would be present In Information such as "A quill Is a 
goose feather", "Pens can be made of qui lis", etc. The 
justification for this model Is that the course of a path 
describes the relationship between the two nodes, and that Its 
length Is a measure of their relatedness. The use of 
association as a model for computer language understanding has 
been most Influenced by the work of Qullllan <Qullllan 196k, 
1969>. Information is coded Into the network of concepts using 
several types of links (for example, the class-subclass 
relationship used In the categorization model). In 
understanding a sentence, a search is Initiated through the 
network from each of the content words of the sentence, to find 
the shortest paths linking them. The system uses the 
Information along that path to decide what the sentence is 

about. 

It Is Important to understand why I call this the 
"association" model Instead of the "network" model. The word 
"network" has been used to refer to every conceivable variety of 
data structure. The semantic markers In the categorization 
model form a network , Schank <Schank 1969> refers Jo the 
output of his parser as a "language-free conceptual network , 
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em 

a "network" Is not much more" Informal! V*'tn*n%ayl!t/thaTll Tl l* 
represented by bits In a computer memory. What mus? be stated 
Is the way the network Is used. »««o 

A central committment of the association model Is that 
There Is a significance to tracing along the links from node to 
node Ignoring their content. Once a path Is found, all sorts of 
logical operations may be used to determine Its significance and 
make use of Its Information, but In the propagation, a minimum 
of calculation Is done at each node. 

It Is difficult to formalize a "minimum" of calculation, 
but It Is Important to have some understanding of Its 
Implications. Any computation whatever, can be expressed as a 
network by drawing a flow chart, with the blocks of computation 
as nodes, and the transfers of control as links. The 
computation then traces a path through the net. It might seem 
that there Is something Inherently different between a program 
following a single path through a flowchart, and a signal 
propagating In all directions through a net. However the 
difference disappears If we allow some sort of parallel 
processing (for example the pseudo-simultaneous evaluation of 
several paths, as found In many simulation languages, and some 
theorem-provers such as new versions of PLANNER <HewItt 1970>). 
This Is not the place to debate the merits of parallel vs. 
serial processing. The Important thing Is to realize that once 
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networks carry out computations at each node, they move away 
from the association model, towards a model of semantics as a 

program. 

Some types of natural language understanding do appear to 

Involve simple associations. On hearing the words "fortune" and 
"almond", a person will (If he has eaten In Chinese restaurants) 
think of the word "cookie". It Is hard to describe logical 
connections which lead to such a quick reaction, and much more 
appealing to picture a short association path between "cookie" 
and each of the original words. The model can also be used to 
explain the choice of a single meaning for a potentially 
ambiguous word In a sentence. If a word Is connected by a link 
to each of the concepts It might describe, the shortest path to 
the other words In the sentence should be through the relevant 
meaning- 
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!U£J| Procedure 

We can call the model of semantics used In our system the 
"procedure model". The primary organization of knowledge Is In 
a deductive program with the power to combine Information about 
the parsing of the sentence, the dictionary meanings of Its 
words, and non-1 Ingulstlc facts about the subject being 
discussed. Any relevant bit of knowledge can Itself be In the 
form of a program or procedure to be activated at an appropriate 
time In the process of understanding. The program operates on a 
sentence to produce a representation of its meaning In some 
Internal language. In our case PLANNER. This language allows the 
expression of a wide variety of the aspects of language — 
logical connectives and quantifiers, time references (provided 
by verb tenses and modifiers), different sorts of object- 
modifier relatonshlps, types of object reference (e.g. the 
difference between "the dog" and "a dog"), etc. 

In analyzing a sentence, the program can use Information 
about previous sentences In the discourse and about the subject 
being discussed. This allows It to deal with features of 
language such as pronoun reference, substitute nouns, the effect 
of discourse on specific referents, and the disambiguation of 
meaning through knowledge of non- 1 Ingulst lc facts (like Bar- 
Hlllel's classic example of the "box In the pen" <Bar-HI I lel>) . 

Other programs, such as <Woods 1969> also use the procedure 
model. These programs use a complete syntactic parsing of the 
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Input to provide a framework from which the program can decide 
what aspects of meaning to deduce. The majority of the 
language-comprehending programs have used the procedure model In 
a simplified form, performing only a few elementary types of 
deduction In analysis, and having an Internal language tailored 
to a specific application. 
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** « S» 5 Evaluating lh& Models 

There Is no single set of criteria to judge the success of 
a model of semantics or a computer program for "comprehending 
language". Success Is relative to the goals of the model, and 
the aspects It wishes to describe. Four criteria seem to be of 
Importance both In computer language comprehension and In 
developing a theory of semantics. These are: ability to 
combine syntax and meaning ; efficiency; ability to explain 
human performance; and the ability to understand language In 
context. These will be discussed separately. 
A. Integrating Syntax 

There are many facets to the meaning of an utterance In a 
natural language, and no model sheds equal light on all of them. 
In fact, two of the three models are limited to one part of the 
meaning — the basic semantic relationships between the words 
used In the sentence. 

The Fodor-Katz version of the categorization model does not 
attempt to deal with the part of meaning expressed by the 
semantic distinguishes, analyzing only those aspects which can 
be modelled by the markers. It does not work with other aspects 
of meaning such as tense, mood, and reference to objects. 
Schank's version attempts to model the way people understand a 
sentence, describing an actual parsing process. However the 
conceptual parsing does not actually find the "meaning". One 
argument for the model Is human ability to understand utterances 
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[n a foreign language without a detailed knowledge of the 
grammar. Only the meanings of word* are needed to find the 
language-free conceptual connections (see <Schank 1970>, P.O. 
This seems a good parallel to the type of understanding done by 
the categorization model. Anyone who has tried to get along In 
an unfamiliar foreign language will be familiar with the 
following experience. A friend says something which contains 
the foreign language equivalents of the words "like", "see", and 
"film". The foreign visitor knows the words and their "word- 
concept couplings", but is totally at a loss In trying to 
respond, since the sentence may have been any one of a vast set. 

Including: 

"I like seeing films." "Have you ;«; any f lln* you J ( j k jJ'" "' 
see you like films." "Would you like to see a film? I saw a 
film I liked." "Did you like seeing the film? etc. 

Without the additional meaning provided by syntax, It Is 

Impossible to understand the content of the sentence. If the 

visitor responds "We are talking about a person who sees a film 

and likes the film", his foreign friend can rightfully reply 

"Oh, you didn't nndprstand." This problem applies equally to the 

association model. Finding the Intersection of signals from the 

nodes "see", "film", and "like" might produce the right 

conceptual relationships, but none of the additional 

Information. Neither of these two models has been the basis for 

an actual question-answering system, since they do not deal with 

the ways In which syntax conveys meaning, and therefore Ignore 
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those aspects of semantics In which syntax plays a targe role. 
The procedure model pays more attention to a complete 
understanding of a sentence, trying to Interweave syntax, 
semantics, and deduction In order to actually answer a question, 
use a piece of new information, or follow a command. 
B. Efficiency 

It would seem reasonable to look for capabilities of the 
other two models not possible for the procedure model. In one 
sense, this quest Is a joke. Since a procedure system has 
information In the form of programs, those programs can Include 
simulations of any other model. The significant question Is not 
what Is possible theoretically, but what Is reasonable to do. 

A program could play a simple game like NIM by using 
standard strategies of minima* and look-ahead. If It could win. 
It would provide a successful model of NIM playing. However It 
makes no use of the simple winning strategy, and therefore Is a 
bad model for the specific game. Similarly, the procedure model 
approaches semantics In a general way, saying that every part of 
semantics Involves powers of deduction and the ability to 
combine Information of a variety of types. If in fact, major 
parts of language comprehension can be explained by more 
elementary approaches, the general procedure model Is not a good 
description for those areas. 

The Issue at stake Is more than computer efficiency. Sin 
we are modelling a natural process, the criterion of "Occam's 
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razor" applies just as In any other science. The most 
satisfactory explanation Is the least complex one which can 

account for the facts. 

The area of computational complexity Is barely charted, and 
it Is nearly Impossible to determine that some computation Is 
Inherently "more complex" or "more difficult" than another. It 
is especially dangerous to characterize high-level processes 
like deduction, since a computation which takes Impossibly long 
using one scheme may be trivial for another. However, there Is 
an Intuitive sense In which efficiency can be judged. A 
procedure system could handle the "fortune -- almond" example by 
systematically looking through the things It knows about 
fortunes and almonds, and using some sort of analogy program to 
test for relationships. This seems clearly more complex than 
the presence of a simple association link. Those who advocate 
the assoclatlonal model feel that there will be many such cases 
In which the deductive process needed to find the path would be 
Impossibly torturous and lengthy. 

There Is also a complexity of syntactic parsing. The 
semantic connections might give clues to the underlying 
structure which would change the parsing task Into simply 
checking the plausibility of the relations, and cleaning up the 
details. This Is the approach taken by both Schank and 
Quill Ian. The example Involving the boy, ball, and window 
involves a complex syntactic structure which could not be 
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handled by most of the parsers which have been written for 
computers, yet a simple set of semantic criteria seem to analyze 
It directly without any complex syntax rules. 
C. Modelling Human 8ehavlor 

It is difficult to find psychological experiments which 
could decide between one model and another, since the underlying 
conceptual structure Is too complex to Isolate a "single 
Interaction". However, some examples seem to suggest the 
validity of various models. In speech communication, people 
understand sentence fragments, scattered words, and blurred 
Phrases which require filling In much of the meaning. In a 
model requiring a complete parsing, this would add a great deal 
of complexity, since the parser would have to know about the 
different types of fragments as well as the grammatical 
sentences. A semantic relation model suggests that the syntax 
Is only used at the end of the process, to check on the 
conceptual message. If the syntax Is lacking, the final check 
is gone, but the basic meaning Is still discoverable. 

Special types of language use, like poetry, puns, and jokes 
seem to Involve simple associations! links. Often the punch 
Hne of the joke comes from recognizing the Inapproprlateness of 
the link which was made, while the poem conveys meaning by 
showing that a link Is not as Irrelevant as It outwardly seems, 
but hints at deeper connections. 
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0. Context 

One of the most Important facts about language 
comprehension Is that sentences do not appear In logical 
Isolation, but are always part of a context, both of other 
utterances and of the situation In which they are uttered. Some 
semantlcists try to avoid this problem. Bay In. It goes beyond 
the proper realm of semantics. Katz and Fodor believe that 
"...a semantic theory cannot be expected to account for the way 
setting determines how an utterance Is understood." «Katz> p. 
.86) However, If semantics Is to be a study of the way language 
and meaning are actually related, we cannot tgnort the facts. 
One of the main strengths of the procedure model Is Its 
ability to Include all sorts of knowledge In making deductions 
at any stage of semantic analysis. The program can call on the 
contextual knowledge just as easily as the dictionary 
definitions or syntax. Within the framework of the basic 
procedure model, there can be a detailed model of those parts of 
the context which are needed for understanding (for example a 
memory of the objects which have been mentioned, so pronouns can 
refer back to them) . The examples below show some of the 
problems Involved In other models when context enters Into 

understand! ng. 

Schank uses the sentence "I hit the man with a stick." to 

Illustrate conceputal parsing. (<Schank 1970> p. 26) Since 
"hit" takes an Instrumental of the type "weapon", the conceptual 
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parser first assumes that the phrase "with a stick" has this 
meaning. Let us look at two possible contexts for the 
sentence: 

1. Three men attacked me. I hit the man with a stick. 

2. A man attacked me. I hit the man with a stick. 

The phrase "with a stick" Is Interpreted differently In the 
two cases, but this cannot be because of differing conceptual 
relations. The relevant Information Is that a phrase like "the 
■nan" will only be used when It Is clear which particular man Is 
meant, while "the man with a sttck" will be used only In trying 
to distinguish one particular man from others. 

Another example used (<Schank 1970> p. 11) | s the 
disambiguation of the word "fly" depending on whether Us 
subject Is a "pilot" or not. If you know that Ed's father Is a 
Pilot, the sentence "Ed's father flew to Chicago." should be 
Interpreted In the sense of "operating a plane". But there are 
no categorization clues In the words "Ed" or "father", for the 
conceptual parser. 

It seems that within the association model there should be 
some way to make use of this Information. If there were a node 
linked to "Ed", "father, and "pilot", then the network search 
Involving "Ed", "father", and "n y " would go through It, and the 
relations could be determined from the path. This approach Is 
deceptive, as adding this type of knowledge creates a world of 
false short paths. This Is a problem Inherent to the approach. 
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The earner example of "goose" "quill" W "Ink" could be 
extended one place further from "Ink" to "spot". It Is 
extremely unlikely that a sentence containing "spot" and "goose 
would actually Involve this connection. Yet It might be a much 
shorter path than the one actually representing the connection 
In "I spotted a goose." As the amount of knowledge grows, there 
will be a rapidly expanding number of false paths, made up of 
links which are individually very close, but which hear no 
logical relation to each other. Since the association network 
does not check the logical relations of the links until after 
the path Is found, there Is no Irradiate way to sort these out. 

By Including specific knowledge, this problem Is 
exacerbated, since each node will have a lar*e number of links, 
of widely differing logical types. 

There are various ways to sneak deduction into the 
association model, and for each simple example, It is possible 
to design a trick which cuts out the Irrelevant links 
(Qullllan's distinction between property and superset links 
<Qul11lan 1969> Is an example). As the amount of Information In 
the net Increases, It needs more ad-hoc deductive schemes, and 
In order to handle language generally, the association net will 
become a full-fledged parallel processor using procedures to 

find semantic relations. 

In addition to swamping the system with Implausible links, 
the association model can produce very plausible Incorrect 
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links. If the sentence: 

"He hated the landlord so much he decided to move Into the 
house on Harvard St." 

were given to a system like TLC, It would be hard to restrain It 
from saying "We are Calkin* about a landlord who owns a house. 
The house Is located on Harvard St...." The path from "landlord" 
to "house" will be as strong as the path from "lawyer" to 
"client" In the standard assoclatlonal example. But In this 
case, deduction is needed to realize that the association Is 
wrong. A person would move out of the house of a landlord he 
hated, not Into It. 

These examples point out a serious defect of non-deductive 
models. Earler sections discussed the existence of areas of 
language comprehension which could not be handled without syntax 
and deduction. These examples Indicate that deduction Is 
necessary even for the tasks for which the other models are 
designed — finding the semantic and conceptual relations 
between the words In the sentence. 
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ii.5.6 roncluslons 

Although the procedure mode! seems to account for much more 

of language behavior than either of the others, there are some 

parts of language understanding where they are especially 

applicable. There Is no one aspect of understanding which can 

succeed without deduction, but the simpler models can be of help 

In making appropriate deductions In many sentences. 

In trying to understand language at a deep level, a system 

cannot hope to simply throw together the advantages of these 

different models, but needs a way to Integrate them usefully. 

This Is the primary advantage of the procedure point of view. 

It Is flexible enough to make use of the other models In a 

systematic way In an Integrated system. If semantic criteria 

can simplify parsing, a partial semantic analysis can be 

Included In the procedure before syntactic analysis. This could 

Involve category matching, or even a controlled search through 

some sort of association net. These might be used as well In 

choosing between meanings of a word, or In finding Information 

applicable to a deduction about the subject matter In generating 

a response. 

The degree to which these special sub-models could be used 
would depend on the particular application and the subject being 

di scussed. 

A program to read poetry or retrieve documents on the basis 

of vague descriptions of subject matter may need a strong 
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association component. One written to answer questions about 
airline schedules, or understand questions and commands to a 
robot will be more procedurally oriented. In general, a use 
Involving detailed knowledge of any specific subject can rely on 
deduction, while an application needing superficial knowledge of 
a wide range of subjects can benefit from association, and will 
be correspondingly weak In Its ability to give specific 
responses. 

Specific models such as association and categorization are 
subparts of a model of language understanding, while the 
procedure model Is fundamentally an approach to Integrating all 
of the different sub-models Into a total semantic theory. It 
represents a point of view that no part of the process can be 
Isolated from the basic computational power or "Intelligence" of 
the language-understander (whether human or computer). 
Understanding of language, as well as other types of human 
behavior, depends on this ability to see the operation of 
Intelligence at every level of processing. 
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Chapter 5. Conclusions 

5.1 Teaching, Telling and Learning 
One of the most Important requirements of a natural 
language understanding system Is generality. It should not be 
based on special tricks or shortcuts which limit It to one 
particular subject or a small subset of grammar, but should be 
expandable to really handle the full diversity of language. In 
each of the three preceding chapters we have pointed out that 
many approaches to language understanding are quite limited, and 

have tried to Illustrate the progression within each sub-area 

towards more general approaches. 
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Sil.l Types flf KnowledgP 

In evaluating the flexibility of a system, we must consider 
the four different levels of knowledge It contains. 

First, there Is the "hard core" which cannot really be 
changed without remaking the entire system. This Is Its "Innate 
capacity" -- the embodiment of Its theory of language. At this 
level we must deal with such questions as whether we should use 
a top-down transformational parser, a semantic net, or some 
other approach to the basic analysis of a sentence, or whether 
we should have special tables of Information or a general 
notation (such as the predicate calculus) for representing 
Information. 

The second level of knowledge Is the complex knowledge 
about the language and the subject being discussed. This would 
Include such things as the grammar of a language, or the 
conceptual categories Into which the speaker divides his model 
of the world. If we think about the human speaker, this Is a 
type of knowledge which Is obviously not Innate (since the 
grammar would be different for English and Chinese and the set 
of concepts used would differ for talking about toy blocks and 
talking about love stories). However It Is not something which 
he learns by being told, or which he changes very easily. Over 
a period of years, he builds up a store of very complex. 
Interrelated knowledge, which serves as a framework for more 
specific Information. 
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The third level Is our storehouse of knowledge about the 
details of our language and our world. It Includes the meanings 
of words, and most of what we called "complex knowledge" In 
section 5.1.3. This would Include such things as "A house built 
on sand cannot stand.", "If you want to pick up a block, first 
clear off Its too.", or "Sunspots cause strange weather.". In 
human terms, this Is the knowledge we are continually learning 
all of our lives, and forms the bulk of what we are taught In 

school . 

Finally, the fourth level Is the set of specific facts 
which are relevant to a discussion. This Includes facts such as 
"Flight 342 leaves Boston at noon.", "The red block Is 3 Inches 
tall.", or "A banana Is hanging above the chair.". This Is the 
easiest type to learn, since It does not demand forming any new 
Interrelationships. It Is more like putlng a new entry Into a 
table or a new simple assertion Into a data base. There Is no 
sharp distinction between levels three and four, but within any 
given system there will usually be two different ways of 
handling Information corresponding to this dlstlnctloa Let us 
look at the three areas of syntax. Inference, and semantics, and 
see how these different levels of knowledge relate to language 
understanding programs and the way they can learn. 
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£0*2 Syntax 

In syntax It Is clear that at the top level of knowledge 
there will be a basic approach to grammar, whether It be 
transformations, pattern matching, or finite state networks. In 
addition, there must be some sort of built In system to carry 
out the parsing. 

Some programs (such as the early translation programs) had 
the grammar built In as an Integral part of the system. In 
order to add new syntactic information It was necessary to dig 
Into the deepest Innards of the system and to understand Its 
details. It was recognized quTte early that this approach made 
them Inflexible and extremely difficult to change. The majority 
of language systems have Instead adopted the use of a "syntax- 
directed" parser. A grammar Is described by a series of rules 
which are applied by a uniform parsing procedure. In handling 
simple subsets of English, this turns grammar Into a third-level 
type of knowledge. We can add new single rules (for example, 
adding the fact that verbs can have a modifying adverb) In a way 
similar to adding words to a vocabulary - without worrying 
about the Interaction between rules. This simplicity Is 
deceptive, since It depends on t«e simplicity of context-free 
grammars for small subsets of natural language. Once we try to 
account for the complexities of an entire language with 
something like a systemic or transformational grammar, we must 
again pay attention to the complex interrelationships between 
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the rules, and the grammar becomes a tangled web Into which any 
new addition must be carefully fitted. For examples of the 
complexity of current transformational grammars for English, see 
<Kllma>. More recent programs which use transformational 
grammars <Woods 1970>, <Bobrow 1970> <Thorne 1968, 1969> 
recognize the fact that syntax Is not really that simple, and 
adopt a more Interrelated representation such as networks. 

In our system we have used programs to express the grammar, 
as explained In chapter 2. This Is not a return to the original 
first-level representation, since the grammar programs are 
completely separate from the system Itself. One of the 
arguments for using syntax-directed parsers was that the grammar 
rules could be expressed In a uniform way which did not depend 
on the details of the parsing program. Therefore changes could 
be made more easily and the grammar was expandable. By 
designing a special language for writing grammars, we can use a 
representation which Is just as general as syntax-rule tables, 
but which allows greater flexibility In designing a grammar, and 
relating It to semantics. 

How difficult Is It to change our grammar? For small 
changes (like allowing noun groups to contain only a number, as 
In "Are there any books? I want three.") only one or two 
additional lines of program would be needed. For a more 
substantial change (like addlna a new type of modifying clause) 
we might need as many as a dozen small additions to the grammar 
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In different places which would be affected. The first change 
could be done with little difficulty by anyone with an 
understanding of PROGRAMMAR and section 2.}. The second would 
take a deeper understanding of how the grammar Is written, but 
would still Involve only a small amount of programming, and of 
course would not Involve changing the basic system at all. The 
grammar was written to be fairly complete and with expansion In 
mind. It seems flexible enough that we will be able to Include 
as much of the the complexity of English as we want. 

What is Important In terms of learning Is that this Is 
level-two knowledge — It Is the type of knowledge which Is 
learned once In a lifetime by a person (or computer program), 
and should not need any major changes after childhood. 
Therefore although It must be changeable, we do not need to 
worry about "quick" learning techniques. If any learning Is 
studied at this level, we must deal In a sophisticated way with 
the methods used to learn large amounts of complex Interrelated 
material. Those computer programs which have "learned" syntax 
(<McConlogue> <Slklossy>) have done so by taking such an 
oversimplified view of syntax that the results hardly have 
significance for natural language. 

At level three of our knowledge of syntax, we have our 
knowledge of particular words, their grammatical categories and 
peculiarities. We need to ask. How easy Is It to add new words? 
How much do we have to know about the grammar to Increase the 
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vocabulary? In most systems there are a few words (such as 
"be", "there", or "than") which have complex and unique 
grammatical behavior. These are built Into the grammar 
Initially at level two. The rest of the vocabulary, like nouns 
and verbs, can be specified In a simple format. Our system Is 
no exception. To add the words "cat", "purple", and "walk" to 
the system, we would only need to know the right abbreviations 
(from section 2.3) to enter In LISP: 
(DEFLIST WORD (CAT (NOUN NS) ) (PURPLE(ADJ) ) (WAUUVB INF ITRNS))) 
This says that "cat" Is a singular (NS) NOUN, "purple" Is 
an ADJectlve, and "walk" Is the INFInltlve form of an 

InTRaNSItlve VerB. 

Can we give this Information In English? It would be 
straightforward to add the right terms to the vocabulary and set 
up simple PLANNER theorems which would allow us to say "'Cat' Is 
a noun." or "'Walk' Is an Intransitive verb." It would be an 
Interesting project to see how far this could be extended. Some 
programs have avoided giving dictionary entries to these "open 
class" words (like verbs, nouns, and adjectives) and let the 
parser determine their part of speech from context. <Thorne 
1969> This approach Is not generally meaningful for a complete 
language understanding system, since we need a dictionary of 
meanings. It could be used when adding new words to the 
system, and could be done so trivially In our Input programs, by 
assigning all unknown words to have all possible "open class" 
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grammatical features, then letting the parser choose the correct 
ones for the context. 
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5.1.3 Inference 

In the domain of Inference, there has been tremendous 
variation In how different systems treat knowledge. In the 
early programs, all of the complex Information was at level one 
(built Into the system), while the specific facts were at level 
four. As we have discussed, this made it very hard to modify or 
expand the complex Information held by the system. In the 
theorem proven, all of the complex Information was treated at 
the fourth level — as a set of Individual formulas which were 
treated as isolated facts. At level one, they have a uniform 
proof procedure as the heart of the system. We have discussed 
how this lack of Information at other levels (Information about 
the Interrelationships between different theorems) severely 
limits this approach. In our system, only simple assertions 
(such as "Noah Is the father of Jafeth.", or "Parent-of Is the 
converse of Chlld-of.") are dealt with at the lowest level. The 
rest of the knowledge is In the form of PLANNER theorems which 
have the ability to Include Information about their connections 
to other theorems. Some of these, such as the examples In 
section 3.1.3 about canaries and thesis evaluation, are at the 
third level, since they are not Interwoven Into complex 
relationships with other parts of the knowledge. Other 
theorems, such as the BLOCKS programs (section 3.1.) for keeping 
track of a table full of objects, are at level two. 

Again we can ask, how easy is It to add or change 
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Information at each of the levels. At the two ends, the answer 
is clear. At the top we have PLANNER and our committment to Its 
kind of theorem-proving procedures. Any change In this Is a 
major overhaul. At the bottom level, we have simple facts like 
"The red pyramid Is supported by the green cube." These are the 
facts which the system plays with whenever It Is conversing. 
They can be changed by simply telling Information (either In 
English or PLANNER), and are changed automatically when things 
happen In the world (for example If we move the red pyramid). 
The middle levels form the much more Interesting problem. 

At the second level we have our basic conceptual model of 
the world. This Includes our choice of categories for objects, 
ways of representing actions, time, place, etc. One of the 
benefits of PLANNER (and of LISP, In which It Is embedded) Is 
that we have a variety of useful facilities to represent our 
world efficiently. Section i.k described the BLOCKS world, and 
It should be similarly easy to define new worlds of discourse 
for the system (see below for examples). 

The third level presents the most Interesting problems for 
adding new Information to the system. It Is simple to do so In 
PLANNER by adding new theorems, but we would like to do It In 
English as well. Of the previous systems, the only ones which 
could accept complex Information In English were the theorem 
provers which dealt with It at the fourth level (as a set of 
unrelated formulas). In our sample dialog, we have some 
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examples of telling the system simple and slightly complex 
Information In English. Saying "I like blocks which are not 
red, but I don't like anything which supports a pyramid." 
created two theorems. The first says, "If you want to prove I 
like something. Prove that It Is a block and that It Is not 
red." This Is no different from a formula for any theorem 
prover, since It Is not related to the system In any complex 
way. The second theorem says, "If you are trying to prove that 
, like something, and you can prove that It supports a pyramid, 
then give up." This Interacts with the other goals and theorems, 
but In a very specialized way. 

Much smarter programs could be built to accept complex 
Information and use It to actually modify the PLANNER theorems 
aIr eady In the data base. For example, we might have a theorem 
to pick up a block, but It falls whenever the block has 
something on top of It. We would like to say In English, "When 
you want to pick up a block, first take everything off of It.", 
and have the system add this Information to the theorem In the 
form of an additional goal statement at the beginning. In order 
to do this, the system must have not only a model of the world 
it talks about, but also a model of Its own behavior, so that It 
can treat Its own programs as data to be manipulated and 
modified. This Is one of the most fascinating directions In 
which the system could be expanded. 

Another Is the possibility of letting the system learn from 
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experience. This Is a complex problem and can be dealt with at 
many levels. At a simplistic level, we can have It "learn" 
specific facts. For example, we have a theorem which proves 
that a block has Its top clear <by proving It supports nothing). 
As the last line of this, we have the PLANNER statement 
(THASSERT UCLEARTOP $?X )>, which says th« we should add to the 
data base the assertion that this block Is clear. If we then 
need the fact again, we don't need to repeat the deduction. In 
B sense the system has "learned" this fact, slnC e It has been 
added to the data base without being mentioned In the dialog. 
But In another sense. It hasn't learned any new Information, 
since nothing can be deduced with this fact that couldn't have 
been done before using the theorem that already existed. A more 
Interesting type of learning would be shown by changing- the 
PLANNER theorems for accomplishing a goal, depending on what had 
been achieved In the past. For example, we might have a goal 
statement with the recommendation (THTBF THTRUE) meaning try 
anything you can. If the goal Is achieved using some particular 
theorem, we might have the system change the recommendation to 
suggest trying that theorem first. At a more advanced stage, we 
would have a heuristic program which tried to figure out why a 
particular chain of deduction worked or didn't work in a 
Particular case. It would then modify the recommendations to 
choose the best theorems In whatever environments came up In the 
future. It might also recognize the need for new theorems In 
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some 



cases, and actually build them. This Is perhaps closest to 
human learning. It does not Involve juggling parameters or 
adding new Isolated bits of Information. Instead It Involves 
figuring out "How are my Ideas wrong tor right)?" and "How can 1 
change or generalize them?" It Involves a kind of "debugging" of 
Ideas, and Is a key reason for representing knowledge as 
procedures. 
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5< Lj Servant tes 

Since semantics Is the least understood part of language 
understanding. It is difficult to find a clear body of "level 
one" knowledge on which to base a system. Our system has a 
basic approach to semantics, explained In chapter k, but most of 
the semantic work Is done at level two - the Interrelated group 
of LISP programs for handling particular semantic jobs. At this 
level we have two separate areas of knowledge. The first Is 
knowledge about the language, and the way it Is structured to 
convey meaning. This Includes knowledge such as "In a passive 
sentence, the syntactic subject Is the semantic object.", "A 
definite noun group refers to fl particular object In the world 
model." or "'It' Is more likely to refer to the subject of the 
previous sentence than the object." This Is closely tied to the 
grammar, and Is about as hard to modify a5 the grammar programs 
themselves. The other type of level two knowledge Is the 
network of "semantic features" described In section 4.2. This 
Is peculiar to the domain being discussed, and becomes more 
complex as the range of discussion Increases. As we pointed 
out, this is currently separate from the network of "concepts" 
used for Inference by PLANNER, but the two could be combined. 
As with level two knowledge In other areas, this Is not 
something to be quickly learned and changed. Our knowledge of 
how language conveys meaning grows along with our knowledge of 
Its syntactic structure, and Is j ust as seldom modified. 
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At the third level we have the bulk of semantic Information 
- the meanings of Individual words. This Is the part which 
must be easy to change and expand. As In most language 
understanding systems, this knowledge Is In the form of separate 
dictionary entr.es, so that new words can be added without 
changing others. The definition of each word Is a program In 
the "semantic language" described In section U.2, and we gain 
great flexibility from this program form. The writer of 
semantic definitions does not have to be concerned with the 
exact form of the grammar, and If he wonts to enter simple 
words, he can use a standard function to describe them very 
simply. Most words can be added by using the functions CMEANS 
and NMEANS, or by using the particular simple semantic form 
appropriate to the type of word (for example, we would define 
"thirteen" by CCNUM 13))). If we come across a type of semantic 
problem or relationship we hadn't anticipated, or which Involves 
relating things In an unusual way, we can write a LISP function 
as the definition of the word to perform the required 

operations. 

We have tried to design our system so that It would be 
flexible and could be easily adapted to handle other fields of 
knowledge and to have a large vocabulary. It would be nice to 
enter new definitions In English Instead of having to use the 
special semantics language. In our sample dialog, the sentence 
"A "steeple" is a stack which contains two green cubes and a 
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pyramid." produced a new definition for a noun. This Is only 
possible when we can express the definition of the new word In 
terms of the old words and concepts. It Is a hit deceptive for 
a language understanding system to allow new words to be added 
so simply. if w e wanted to define the word "face", so that we 
could talk about the faces of blocks, the system would be 
lacking the basic concepts and relationships necessary to use 
the new word. This kind of knowledge | s at the second level, 
and we cannot expect to add It through a simple definition. 
There must be a powerful heuristic program which recognizes the 
need for a new concept and which relates this concept to the 
entire model of the world. In this example. It would have to 
realize that a face Is a part of an object, but Is not an object 
Itself. This might have varied consequences throughout the 
model, wherever relations such as "part" are Involved. 

Thus although our system can accept definitions of some 
words. It is a worthwhile but untried research project to design 
programs which will really be able to learn new words In an 
Interesting way. We believe that this will be much easier 
within the environment of a problem solving language like 
PLANNER, and that such programs could well be added to our 
system. 
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5,2 Directions for Future Research 
In the preface we talked about using computers In a new way. 
We speculated about the day when we will just tell our computer 
what we want done, and It will understand. This paper has 
described a small step In that direction. Where Is such 
research leading? What approaches should we take In the future? 
We can see three basic directions In which we could extend 
our system. First, at present It knows only about a tiny 
simplified subject. Second, most of what It knows has to be 
programed, rather than told or taught. Finally, we can't talk 
to It at all! We have to type our side of the conversation and 

read the computer's. 

The problem of widening the scope of knowledge Involves 
much more than building bigger memories or more efficient lookup 
methods. If we want the computer to have a large body of 
knowledge, the Information must be highly structured. The 
critical Issue Is to understand the kinds of organization 
needed. One of the reasons that our system Is able to handle 
many aspects of language which were not possible In earlier 
systems Is that I t has a deep understanding of the subject It Is 
discussing. There Is a whole body of theorems and concepts 
associated with the words in the vocabulary, and by making use 
of this knowledge In Its question-answering and action. Its 
language behavior is more like ours. In going to larger areas 
of discourse we cannot give up this Insistence that the computer 
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must know what It is talking about. 

We need a way to Integrate large amounts of heterogeneous 
knowledge Into a single system which can make use of U. At the 
same time, we cannot let the system become overburdened and 
Inefficient by Insisting on a stifling generality and 
uniformity. We want the advantages of specialized types of 
knowledge and structure that can come from limiting the subject 
to a small area, but at the same time we must have the 
flexibility that allows knowledge of different types to 
Interact. PLANNER-Uke languages may be a beginning toward 
these new kinds of organization. 

There are many different approaches which can be taken 
towards higher organization of knowledge. We may want to think 
In terms of a "block-structure" of contexts, each of which 
carries Its own special vocabulary and Information. We may 
think of a network. In which we can consider the "distance" 
between two concepts or words. It might be possible to deal 
with a set of specialized "subroutines" for dealing with 
different kinds of situations. Even for something as seemingly 
simple as chlldrens' stories, there are tremendous complexities 
and a well-structured approach Is necessary. 

In section ii.l.U we discussed some of the ways our system 
could take advantage of this large-scale structure of knowledge. 
The subject matter would Influence the choice of relevant 
definitions of words and appropriate theorems to be used In 
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deduction. This has been explored very little, and there are 
many possibilities for further research. 

The problem of learning Is of great Interest not only to 
those working on practical computer systems, but also to 
psychologists Interested In understanding how learning takes 
place In other Intelligent systems, such as people. We need to 
understand how the amount of knowledge we already have affects 
che amount and the way we can learn. Working on a natural 
language program offers several advantages for studying problems 
of knowledge and learning. Language represents a body of highly 
complex knowledge, which Itself con provide a rich field for 
learning tasks with a wide range of difficulties. Also, 
language Is a major vehicle through which people learn about the 
world. In studying the way that a computer could accept new 
Information in natural language, we are studying a key area In 
learning. We need to understand the ways In which learning 
depends on the organization of our knowledge. We need to 
explore In what ways knowing about Its own mentality could allow 
a computer to really learn. This Is perhaps the most 
Interesting possibility for research, and we have discussed It 

at length In Section 5.1. 

We have discussed the difficulties involved In accenting new 

declarative knowledge In any but a superficial way. One of the 
problems most closely associated with this Is the use of world- 
knowledge In understanding declarative sentences. Compare the 
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sentences: 

I put the heavy book on the table and It broke. 

I put the butterfly wing on the table and It broke* 

The understanding of the referent of the pronoun "It" must 
depend on the likelihood of the different objects breaking or 
causing breakage. This could be handled by having the 
declarative sentence "Interpreter" try out both Interpretations 
and see which leads to more "reasonable" conclusions. 

Out system can currently do this only If the knowledge of 
the world needed Is a specific simple fact ("there Is no block 
In the box.") or a categorical fact ("table can't pick up 
blocks.") A more complex system Is needed to accept general 
declarative statements and explore their consequences. It must 
seek the Interpretation which is neither trivial nor 
Incongruous, but which provides new Information as the speaker 
must have Intended It to. Contextual factors play the major 
role. The expectations might be completely reversed If the 
sentence were preceded by "The strangest thing just happcnedl" 

Finally we have the problem of speech communication with 
computers. Again the Issue Is not one of more efficient 
hardware, but one of knowledge. Spoken language calls on the 
listener to fill In a great deal from his own knowledge and 
understanding. Words, phrases and whole Ideas &re conveyed by 
fragments and mumbles which often serve as little more than a 
clue as to what they Intend, The need for a truly vertical 
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system Is much greater for speech than for written language. 
The analysis at even the lowest level depends on whether the 
result "makes sense." People can communicate under conditions 
where It Is nearly Impossible to pick out Individual words or 
sounds without reference to meaning. 

In our system we tried to Integrate the syntactic, semantic 
and deductive programs in a flexible way. We allow meaning to 
guide the direction of the parsing. Our semantic Interpretation 
Is guided by logical deduction and a rudimentary model of what 
the speaker knows. For spoken language this must be expanded. 
Perhaps we might look for fragments of sentences and use their 
meaning to help piece together the rest. Or possibly we could 
create a unified system In which the deductive portion could 
look at the context and propose what It thought the speaker 
might be saying, on the basis of meaning, and the audible clues 
In the utterance. It might be possible to have a more multi- 
dimensional analysis In which prosodlc features such as voice 
Intonation could be used to recognize important features of the 
utterance. This Is not at all saying that we should throw 
syntax overboard In favor of some sort of vague rplatlonal 
structure. Often the most Important clues about what Is being 
said are the syntactic clues. What Is neeeded Is a grammar 
which can look for and analyze the different types of Important 
patterns rather than getting tremendously Involved with finding 
the exact details of structure In a fixed order. Systemic 
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grammar is a seep in this direction, and the use of programs for 
grammars gives the kind of flexibility which would be needed for 
doing this kind of analysis. It Is not clear whether our system 
In Its present form could be adapted to handle spoken language, 
but Its general structure and the basic principles of Its 
operation might well be used. 

The challenge of programming a computer to use language Is 
really the challenge of producing Intelligence. Thought and 
language are so closely Interwoven that the future of our 
research In natural language and computers will be neither a 
study of linguistic principles, nor a study of "artificial" 
Intelligence, but rather an Inquiry Into the nature of 
Intel 1 Igonce I tself . 
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Appendix A - Index of Syntactic Features 
Underlines Indicate primary description of feature. 



-OB lb8 

-0B2 U8 

ACTV 116, 119, 137, 138 

ADJ 120, 121, 132, 119 

ADJG 121, 121-132, 139 

AOJQ 105, 107 

ADJREL 105 

ADJUNCT 105, 109, 128-131 



ADV 139-lkl 

ADVADV HI 

ADVMEASQ 105, 107 

ADVMEASREL 105 

AGENT 116, 119, 128, 129 

AND 150, 151 

AS 131 

AUX 11(6 

BE 116, 137 

BINDER Oil 

BOTH 150 

BOUND 105, 109 

BUT NOT 150 

CLASF 120, 121, 12b, 1M 

CLAUSE lOb-119,, 121, l«* 

CLAUSEADV Ibl 

COMP 117-118, 123, 126-128, 

COMPAR 121, 131, 132, 139 

COMPONENT 151 

COMPOUND 1U9-152 

COMPQ 105, 108 

COMPREL 105, 111, 113 

DANGLING 105-107, 111, 113, 

DECLARATIVE 105, 106 

DEF 123, 12», H2 

DEFPOSS 123, 127 

OEM 123, 12U, lii2, lb5 

DET 120, 122-12b, iil-lii 

DOWNQ 105, 109 

DOWNREL 105, 111 

DPRT 116, 119 

EN 105, 110, 137, 138, lb8 



FINITE 137 

FUTURE 133, 13U 

IMPER 137 

IMPERATIVE 10b, 105, 106 

INCOM 123, 125, lb2 

INOEF 123, 12b, 125, Mr} 

I NO 105, 109-113, 137, 138, US 

ING0B2 lb8 
INGQ 105 
INGREL 105 
INT 116, 118, 1H8 
IT 116, 119 
ITRIIS 116, 117, lbB 
ITRNSL 116, 118, Ul 
ITSUBJ 105, 113, 119 
LIST 150 
LISTA 150 

LOBd 105, 117, 126, 129 
LOBJQ 105 
LOBJREL 105 
MAJOR 105, 106, 107 
MAS5 1U2, lb3 
lb8 MEASQ 105, 108 

MEASREL 105, 111 
131 MODAL 133, 13b, lbB 

NDET 123, 12b 

NEED2 lb5 

NEG 123, 125, 137, 138, 142 

NFS 123, 127, lbS 

NG 12SL-U1, 128-129, 132 
129 NQQ 107 

NOBJ 123 

NONUM lb2, lb3 

NOR 150 

NOUN 120, lii 

NPL 123, 127, Ib2-lb5 

NS 123, 127, H2-H5 
NUM 120, lii 
NUMD 123, ikJt 
NUMDALONE lbb 
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NUMDAN 144 


QAOJ lift 


NUMDAS 144 


QAUX 146 


NUMDAT 144 


OOET 142 


NUMDET 125 


QNTFR 123, 124, 125, 142 


DBJ 105, 126, 145 


QUEST 123, 125, 128-132, 145 


OBJ1 105, 112, 123 


OUESTION 105-107, 146, 146 


OBJ1Q 105, 108 


RFLAOJ 111, 113 


OBJ1REL 105, 111 


RELDEL 105, 111 


OBJ1UPREL 113 


RELPREPG 128, 129 


OBJ2 105, 112, 123 


REPOB 148 


OBJ2Q 105, 108 


REPORT 105, 112, 148 


OBJ2REL 105, 111 


RSNG 105, 109-113, 119, 128, 148 


OBJQ 105 


RSQ 105, 109, 121 


0BJREL 105 


SEC 105, 106 


OF 123, 125, 128, 129 


SHORT 105, 107, 129 


OFD 142 


SHORTREL 111, 113 


OFD-INCOM US 


SUBING 105, 112, 14B 


OFOBJ 123, 126 


SUBJ 105, 112, 117, 123, 126, 145 


OR 150 


SUBJFORK 151 


ORD 120, 122, 144 


SUBJQ 105, 107 


PAST 133, 134, 148 


SUBJREL 105. Ill 


PA5V 116, 119, 155, 137, 136 SUBJT 116. 123 


POSES 123, 125, 12G 


SUBJTQ 105, 108 


POSS 123, 126, 143, 145 


SUBJTREL 105, 111, 113 


POSSDEF 145 


SUBQ 105 


PREP 128, lii, 145 


SUBREL 105 


PREP2 145 


SUBTO 105, 112, 148 


PREPADV 141 


SUBTOB 148 


PREPG 121, 128-130 


SUP 121, 144 


PREPOBJ 105, 112, 123, 126, 


128 THAN 131 


PREPQ 105, 106, 107 


THAT 105, 112 


PREPREL 105, 110, 111, 113, 


129 THERE 108, 116 


PRESENT 133, 134 


TIM1 143 


PRON XjfcB 


TIME 123, 143 


PRONG 123, 124, 126, 127 


TIMEQ 105, 108 


PRONREL 145 


TIHEREL 105, 111 


PROPN 124, 145 


TO 105, 109-113, 137, 138, 148 


PROPNG 123, 124 


TOOEL 137, 138 


PRT 116, 118, 119, lii 


TPRON 124, 132, 1££ 
TPRONG 123, 124 


Q 120, 128, 129, 131 
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TRANS 116, 117, U8 
TRANS2 116, 117, UB 
TRANS2T0Q 105 
TRANS2TOREL 105 
TRANSL 116, 117, H8 
TRANST02Q 108 
TRANST02REL 111 
UPQUEST 128, 129 
UPREL 105, 113, 128, 129 
VJPS l>i8 

VB li£/ XM 

VBAD lfcl 

VBFORK 151 

VFS 1<*8 

VG Hl-iii, I 1 * 8 

VPRT 11*8 

WH- 105, 106, 107 

WHRS 105, 110, 115 

WORD 112-UlS 
Yes-Mo 105, 106 
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Appendix B - Sample Parsings 



YOU HADN'T LEARNED YOUR MOTHER'S RECIPE 



WAS 



USE IN 
WRONG? 



THE CAKE IF 



EGGS WOULD YOU HAVE BEEN GOING TO USE IN THE 
IF YOU HADN'T LEARNED YOUR MOTHER'S RECIPE WAS 

NGQUES POLR2 ACTV OBJ1Q TRANS) 



(WOULD 
((YOU) 

((HAVE 



((IN THE CAKE) 



((IF YOU 

(CLAUSE 

((IF (B 

((YOU) 

((HADN 



(((HOW MANY 
CAKE 

wrong: 
(clause major quest 
(((how many eggs) 
(ng quest howmany noef npl det) 

((HOW (QDETJ) (MANY (QDET)) (EGGS (NOUN NPL)))) 
(VB AUX MODAL QAUX)) •*<■>//* 

B?FN S rniM? E Tn N , S ,ccr L ! ((Y0U {PR0N NPL Ns SUBJ OBJ)))) 
BEEN GOING TO USE) (VG MODAL NAGR (FUT PAST MODAL ) 

((WOULD (VB AUX MODAL QAUX)) 

(HAVE (HAVE VB AUX INF TRANS)) 

(BEEN (AUX VB BE EN)) 

(GOING (VB ITRNS INQJ) 
(TO (TO)) 

(PREPG) <USE (VB ' NF TRANS MVB)),) 

((IN (PLACE PREP PLACE)) 
((THE CAKE) 

(NG OBJ DET NS DEF) 

((THE (DET NPL NS DFFJ1 (CAKF (unuti u*\\\\\s 

INDER)) 

■t N le5rSed? EF NS m) ((Y0U (PR0N npl ns subj 0BJ) >» 

(VG VPL V3PS MEG (PAST PAST)) 

((HADN'T (HAVE VB AUX TRANS PAST VPL V3PS VFS NFR>) 
, (LEJRNED (VB TRANS REPOB PAST EN MVB)))) ' 

(YOUR MOTHER'S RECIPE WAS WRONG) 

5/J«m^ HH REP0RT 0BJ 0BJ1 DECLAR BE INT) 
(((YOUR MOTHER'S RECIPE) 

(NG SUBJ NS DEF DET POSES) 

(((YOUR MOTHER'S) 

(NG SUBJ NS DEF DET POSES POSS) 

(((YOUR) (NG SUBJ POSS) 

(u <vrurr,.£ (Y0UR <PR0N NPL NS SUBJ OBJ POSS)))) 
(MOTHER'S (NOUN NS POSS)))) ™^,ni 

(RECIPE (NOUN NS)))> 

((WAS) (VG V3PS VFS (PAST)) 

t/MiMMt'tft! UUX VB BE V5PS VFS PAS T MVB)))) 
((WRONG) (ADJG Q COMP) ((WRONG (ADJ) ))))))))) ) 
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TABLE. 

<«>« rssi'ss nuc k-sts aWiM-wr 

THE TABLE) 

RSSSWlSB! KJlC^PRT VB Iff TRANS «VB>») 

IVLvTHUlG GREEN /, AT LEAST THREE OF THE BLOCKS /, AND 
EITHER A BOX OR A SPHERE WHICH IS BIGGER THAN ANY 
BRICK ON THE TABLE) 

(NG OBJ OBJ1 EITHER COMPOUND LIST NS) 

M (ANYTHING GREEN, ( NG^OW »*\^ffSm I ««EEH (A0 J »,> 

((AT LEAST THREE OF THE BLOCKS) 
(NG OBJ OBJl COMPONENT NUMD HUM NPL OET OF) 

((AT (AT)) 
(LEAST (NUMD NUMDAT)) 

(THREE (HUM)) 
((OF THE BLOCKS) 

(PREPG OF) 

((OF (PREP)) 
((THE BLOCKS) 

(<THE B <DET T Hp[ L MS E D EF» (BLOCKS (NOUN NPL) )))))) ) 
((A BOx'ol A SPHERE WHICH IS BIGGER THAN ANY BRICK ON THE 

(NG^OBJ OBJ1 COMPONENT OR COMPOUND BOTH NS) 
(((A BOX) (NG OBJ ObJl COMPONENT DET NS INDEF) 
U(A HUXI JNb INDEF)) (BOX (NOUN NS)))) 

((A SPHERE WHICH IS BIGGER THAN ANY BRICK ON THE TABLE) 

(NG OBJ OBJ1 COMPONENT DET NS INDEF) 

((A (DET NS INDEF)) 

ftSSfcH {FSWKfmAH ANV BRICK ON THE TABLE, 

I^ChWrEL^DE^PU UHHtCH .NPL,,,, 

((IS) (VG V3PS (PRES)) 

((IS (AUX VB BE V3PS PRES MVB) )) 
((BIGGER THAN ANY BRICK ON THE TABLE) 
(ADJG Q COMP COMPAR THAN) 
((BIGGER (ADJ COMPAR)) 
(THAN (THAN)) 
((ANY BRICK ON THE TABLE) 
(KG SUBJ COMPAR DET NS QNTFR) 
((ANY (DET NS NPL QNTFR)) 
(BRICK (NOUN NS)) 
((ON THE TABLE) 
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(PREPG Q) 

((ON (PREP PLACE)) 
((THE TABLE) 
(NG OBJ OET NS OEF) 
((THE (DET NPL NS DEF)) 
(TABLE (NOUN NS) ))))))))))))))))))) ) 
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Appendix C - Sample BLOCKS Theprems 



(DEFTHEOREM 



TA-CONTAIN 
(THANTE (X 



Y Z) 
(#AT S?X ?) 
(THGOAL UMANIP $?X)> 
(THGOAL ('SUPPORT $?Y $?X)J 
(THOR (THAND (THGOAL COS *?Y 'BOX)) 
{THVSETQ $_Z $?Y)) 
(THGOAL ('CONTAIN $?Z $?Y))> 
(THASSERT ('CONTAIN S?Z $?X)))) 



(DEFTHEOREM TC-MAKESPACE ^ ^ % , 

DspftCE s7SUKF $?S|ZE j ?0 BJ $?SPACE) 
(THNOT (THGOAL WIS $?SURF #BOX))> 
(MEMORY) 

(THAND (THGOAL ('SUPPORT $?SURF S_X)) 
(THGOAL CGET-RID-OF 5?X) 

(THUSE TC-GET-RID-OF))) 



(THOR (THGOAL CFINDSPACE 



(THUSE TC-F 
(THGO TAG)) 
(MEMQREND ('MAKESPACE 



$?SURF 
S?SIZE 
$?OBJ 
$?SPACE) 
INDSPACE)) 



J?EV $?SURF))>) 



(DEFTHEOREH 



TC-MORE 
(THCONSE 



(MEASURE X Y) 

(•MOPE S7MEASUPE $?X $?Y) 

(THVSETQ 5_MEASURE 

(GET J7MEASURE (QUOTE 

(GREATERP (S7MEASURE S?X) 

($?KEASURE $?Y)))) 



MEASF"))) 



(DEFTHEOREM 



TC-OH 
(THCOUSE 



(X Y Z) 

(THOR (THGOAL ('SUPPORT S?Y S?X>> 
(THAND (THASVAL S?X) 

(THGOAL ('SUPPORT $_Z $?X)) 
(THGOAL CON $?Z 5?Y) 

(THUSE TC-ON)))))) 



Appendix C - Page 451 



(DEFTHEOREM TC-PICKUP 

(THCONSE (X (WHY (EV)) EV) 
('PICKUP S?X) 
(MEMORY) 

Kl iflgfSJSSi (THUSE TC - r5RASP>> 

(THNOOB) 
,.«r»„ (THUSE TC-RAISEHANO)) 

(MEMOREND ('PICKUP S7EV S?X)))) 

(DEFTHEOREM TCT-PICKUP 

(THCONSE (X EV TIME) 

('PICKUP t?X S7TIME) 

(THOR (THAND (THGOAL («PICKUP$?EV $?X)) 
(TIMECHK $?EV 57TIME)) 
(THGOAL ('PICKUP S?EV $?X $?TIME) 
(THUSE TCTE-PICKUP))))) 

(DEFTHEOREM TCTE-PICKUP 

(THCONSE (X EV EVENT TIME) 

("PICKUP $?EV J?X $?TIME) 

(THOR (THAND (THGOAL ('PICKUP $?EV $?X)1 

(THSUCCEEDJ. ""*« *"* •*""»» 

(THAMONG S7EVENT EVENTLIST) 

(MEMQ (GET S7EVENT CQUOTE TYPE)) 

,.,,. (f ^ U0TE ('PUTON 'GET-RID-GF))) 

(TIMECHK S7EVENT J7TIME) 

(THOR (THGOAL (#PUTON *7EVENT S?X ?)) 

(THGOAL ('GET-RID-OF S7EVENT »HH 
I"V8fTQ $_EV (MAKESYM (QUOTE F))) 
(AND (PUTPROP S?EV 

(PUTPROP J?EV 

(GET J7EVENT (QUOTE END)) 
(QUOTE START)) 
(QUOTE END)) 



nnlnS^ 5?EV ((}U0TE PICKUP) (QUOTE TYPE)) 
PUTPROP S7EV 57EVENT (QUOTE WHY)) 

SETQ EVENTLIST (CONS $?EV EVENTLIST)) 
(THASSERT (-PICKUP S ?EV S?X)>)) " 



(DEFTHEOREM TE-CONTAIN (THERASING (X Y) 

CAT S?X ?) 



(THGOAL ('CONTAIN 4_Y S7X)) 
(THERASE ('CONTAIN *?Y $?X))) 
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Appendix D - Sample PROGRAMMER PrOEra* 
Excerpt from Noun Group Parser 

?COND ((ISO H NS) (FQ NS)) tT (PQ WOU 
(OR NN (AND (FQ NUMBER) (CO IHCOH))) 

NUMBER 

(FQ DET) 

(CNQ OF) OF ADJ) 

QNUM 

((ISO H NONUM) OF NIL) 

((CQ NPL) (RQ NS))) 

NIL 
(NUMD) 

(<EQ°CCADDR (MB HIS CQ NO)) ADJ NIL) 

?(AND (NQ OF) (PARSE PREPG OF)) SM0F NIL) 
UEQ (CADDR (NB H)) <Q NONE)) IMCOM ADJ) 

SMOF 

((OR°SMN (SMNGOF) (NOT (POP))) RETSM INCOM) 

((PARSE ADO) ADJ CLASF NIL) 

((OR (ISC H SUP) (ISQ H COMPAR)) NIL REDUC) 

(AND (EQ (CADOAR M) (Q Of)) 

lof^M — l (AND <ERT NOUN SMNGOF3, <r.O FAIL))) 

(FQ OF) 
(GO RETSM)) 

(GO INCOM) 

RSPtMIM VB ING (CLASP,, (PARSE VB EN (CLASF,) (PARSE CLASP,, 

CLASF 

NIL 

REDUC) 

&w5WW(ii8Tt« h tii«)» «* «m 

IcoJS HB'CIJJ « "ASS, (OR (CQ PART, (NOT (CQ OET,,,) 
(COND ((NOT^ISq'h "PL)) <RQ NPL "ART))) 

\IZ {IKS 88 ?cS 5 oet!W!cq n UMD „, cyo krl MASS))) 
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((MEET FE 
SMNG 
((OR (NOT 

(OR SMN (SMNG1) (ANd'(ERT 

((NOT 

((AND 



(Q (NS NPL PART MASS))) NIL REDO) 
(CQ OFOBJ)) (CQ 



DET)) NIL FAIL) 

NOUN SMNG1) (GO FAIL))) 



CISQ H POSS)) NIL POSS'RETSM) 
(NEXTW TO) 

(MEET FE (Q (COMP SUBJ))) 
(PARSE CLAUSE RSQ TO) 

RET5M SM " (ShRS ° (SM H) SM> <G0 P0PRE T))) 

NIL) 

((AND (OR (NEXTW AS) (NQ COM PAR) 5 

(PARSE ADJG THANNEED) 
(OR SMN 

(AND (NULL N) 
(CQ SUBJ) 

(ISQ {• C PV) AUX> 
(ISQ PT BE) 
(GO POPRET)) 
(SMPREP (QUOTE ADJ)) 
(GO POPRET))) 
RETSM 
NIL) 
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Appendix E - Sample Dictionary Entries 

Syntactic Definitions 

(defl 1st word 

(LITTLE (AOJ)) 

{LONG (ADJ)) 

(MAKE (VB INF TRANS)) 

(HOVE (VB INF TRANS)) 

(NAME (NOUN NS VB INF TRANS)) 

(NARROW (ADJ)) 
(NICE(ADJ)) 
(OBJECT (NOUN NS)) 

(OFF (PRT)) 

(OUT (PRT)) 

(PICK (VPI1T VB INF TRANS)) 

(PUT (INF PAST VB TRANSL VPRT) ) 

(POINTED (ADJ)> 
(PYRAMID (NOUN NS ) ) 

(RED (AOJ)) 

(RELEASE (VB TRANS INF)) 
(RIGHT (NOUN NS)) 
(ROUND (AOJ))) 

CDEFUST (BE-GAN (BEG.N (PAST) C.NF))) 
(GAVE (GIVE (PAST) (IMF))) 
(SAW (SEE (PAST) (INF))) 
(TOLD (TELL (PAST) (IMF)))) 
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Semantic Definitions 



(DEFLIST SHIJTC 

(A ((OFT T))> 

(ABOVE ((PREP ((T (UOC 'ABOVE T)))))) 

(AFTER ((BINDER (END NIL)))) 

(ALL ((DET (COND ((CQ OF) (Q ALL)) 

((MEET (Q (HUH DEF)) FE) (Q DEF)) 
((Q NOET)))))) 
(BALL ((NOUN (NMEANS (('MANIP GROUND) 

(CIS *** #BALL})))))) 
(BIG ((MEASURE (('SIZE CPHYSOB) T))) 
(ADJ (NMEANS (CPHYSOB 'BID 

(CMORE 'SIZE »•« (200 200 

(BLACK ((ADJ (-COLOR (BLACK)))) ZOO)))))))) 

(BLOCK ((NOUN (NMEANS ( ( 'MAN I P 'RECTANGULAR) 

(CIS ••• IBLOCK))))))) 
(BLUE ((ADJ ('COLOR 'BLUE)))) 

(BY ((PREP (<T (CMEANS (((CPHYSOB)) (CPHYSOB))) 

CNEXTO *l #2 *TIME) 
NIL))))))) 
COLOR ((NOUN (NMEANS (('COLOR) (CIS *•* 'COLOR))))))) 
(CONTAIN ((VB ((TRANS (CMEANS (((('BOX)) (CPHYSOB))) 

('CONTAIN '1 '2 -TIME) 
NIL) 

((((•CONSTRUCT)) 

(('THING))) 

('PART »2 *1 •TIME) 

(CUBE ((NOUN (NMEANS ( ( 'MAN I P 'RECTANGULAR) 

(CIS *•* 'BLOCK) 
CEQDIM ••*))))))) 
(EVERYTHING ( (TPRON (QUOTE ALL)))) 
(FEWER ((NUMD (LIST (Q <) NUM))>) 
(FOUR ((NUM i.))) 

(FRIEND ((NOUN (NMEANS (('PERSON) 

(CIS ••* 'PERSON))))))) 
(GRAB ((VB ((TRANS CGRASP)))))) «»«"""» 

(GRASP ((VB ((TRANS CGRASP)))))) 

(I ((PRON (SETQ SM (Q (FRIEND)))))) 

(IT ((PRON (SMIT (Q IT))))) 

(NICE ((ADJ (NMEANS (('THING) 

(CLIKE :FRI£ND ***))))))) 
(NOW ((ADV (OR (EQ (CADR (ASSQ (QUOTE TIME) FE)) 

(QUOTE :NOW)) 
(ERT NOW DEFINITION))))) 



Appendix E - Pa*e U56 



{THE <(DET T))) 

(THEY ((PRON (SUIT (QTHEY))))) 

(THICK ((ADJ (NMEANS ((#PHYSOB) {(*MORE #THICKNESS 

(0 200 0))))>) 
(MEASURE ((#THICKNESS CPHYSOB) T))))) 
(THIN ((ADJ (NMEANS (<#P1lYSOB) (UMORE #TH|CKNESS 

*••))))> 

(MEASURE t(#THICKNESS (#PHYSOB) NIL))))) 
(THING ((NOUN (NMEANS ((#THING #VAGUE *PHYSOB) 

((#|S •** ?))))))) 
(WHO ({PRONREL (SETQ SM (QUOTE (ANIMATE)))) 
(PRON (SETQ SM (QUOTE (ANIMATE)))))) 

(WHY ((QADJ (FQ WHY))))) 
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Appendix F - PLANNER Data for Dialog In Section 1,3 



C * I S 

CIS 
(»IS 
CIS 
(•IS 
CIS 
(MS 
(MS 
(MS 
IS 
IS 
IS 
IS 
IS 
IS 



:B1 
:B2 
:B3 
:Bi* 
:B5 
:B6 
:B7 

:B10 



C 

C 

C 

C 

(* 

(' 

(MS 

(MS 

(MS 

(MS 

CAT 

CAT 

CAT 

CAT 

CAT 

CAT 

CAT 

CAT 



•BLOCK) 
•PYRAMID) 
•BLOCK) 
'PYRAMID) 
•PYRAMID) 
•BLOCK) 
•BLOCK) 
•BLOCK) 



•RED •COLOR) 

•BLUE *C0L0R) 

•GREEN 'COLOR) 

•WHITE 'COLOR) 

•BLACK #C0L0R) 

•RECTANGULAR •SHAPE) 

•ROUND 'SHAPE) 

•POINTED 'SHAPE) 

:SHRDLU •ROBOT) 

--FRIEND •PERSON) 

tHAND 'HAND) 

:D1 (100 100 0)) 

(100 100 100)) 
(U00 0)) 
(6U0 640 1)) 
(500 100 200)) 
(0 300 0}) 
(0 2b0 300)) 



:B2 
:B3 
:BU 
:S5 
:B6 
:B7 
BIO 



(300 6b0 0)) 
(•SUPPORT :B1 :B2) 

(•SUPPORT :B3 :B5) 
(•SUPPORT :B6 :B7) 
(•CLEARTOP :B2) 
(•CLEARTDP :Bii) 
CCLEARTOP :B5) 
CCLEARTOP :B7) 
(•CLEARTOP ;B1C) 



(•MAN IP 

(•MAN IP 
('MAN IP 
(•MAN IP 
(•MAN IP 



:B1) 
:B2) 
:B3) 
:B4) 
:B5) 



( "MAN I P 

( iHAN I P 

(•MAN IP 

(•SUPPORT 

(•SUPPORT 

(•SUPPORT 

(•SUPPORT 

(•SUPPORT 

(•SUPPORT 

(•AT :BOX 

CIS :BOX 



36) 

B7) 

BIO) 
: TABLE :B1) 
:TABLE :B3) 
:BOX :8b) 
:TABLE :B10) 
:TABLE :BC) 
:TABLE : BOX ) 
(600 600 0)) 
•BOX) 



(MS :TABLE STABLE) 
(•CONTAIN :BOX :Bk) 
(•SHAPE :B1 'RECTANGULAR) 
(•SHAPE :B3 'RECTANGULAR) 
(•SHAPE :B2 'POINTED) 
(•SHAPE iSU 'POINTED) 
(•SHAPE :B5 'POINTED) 
(•SHAPE :B6 'RECTANGULAR) 
(•SHAPE :B7 •RECTANGULAR) 
(•SHAPE :B10 'RECTANGULAR) 
(•COLOR :B1 'RED) 
(•COLOR :B2 'GREEN) 
(•COLOR :B3 'GREEN) 
('COLOR :B(i 'BLUE) 
(•COLOR :B5 'RED) 
(•COLOR :B6 'REO) 
(•COLOR :B7 'GREEN) 
(•COLOR :B10 'BLUE) 
(•COLOR :BOX 'WHITE) 
(•COLOR :TABLE 'BLACK) 
('CALL :SHRDLU SHRDLU) 
('CALL rFRIEND YOU) 



Some of 
deduced 



the data Is entered Initially. The rest can 
and asserted by simple antecedent theorems. 



be 
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